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Methods for Obtaining Plant Varieties 

TECHNICAL FIELD 

The present invention relates to nucleotide sequences which encode polypeptides 
involved in the DNA mismatch repair systems of plants, and to the polypeptides encoded 
5 by those nucleotide sequences. The invention also relates to nucleotide sequences and 
polypeptide sequences for use in altering the DNA mismatch repair system in plants. The 
invention also relates to a process for altering the DNA mismatch repair system of a plant 
cell, to a process for increasing genetic variations in plants and to processes for obtaining 
plants having a desired characteristic. 

, o BACKGROUND OF THE INVENTION 

Plant breeding essentially relies on and makes use of genetic variation which occurs 
naturally within and between members of a family, a genus, a species or a subspecies. 
Another source of genetic variation is the introduction of genes from other organisms 
which may or may not be related to the host plant. 

is Allelic loci or non-allelic genes which constitute or contribute to desired quantitative 

(e.g. growth performance, yield, etc.) or qualitative (e.g. deposition, content and 
composition of seed storage products; pathogen resistance genes: etc.) traits that are 
absent, incomplete or inefficient in a species or subspecies of interest are typically 
introduced by the plant breeder from other species or subspecies, or de novo. This 

20 introduction is often done by crossing, provided that the species to be crossed are sexually 
compatible. Other means of introducing genomes, individual chromosomes or genes into 
plant cells or plants are well known in the an. They include cell fusion, chemically aided 
transfection (Schocher et al., 1986. Biotechnology 4: 1093) and ballistic (McCabe et al., 
1988, Biotechnology 6: 923), microinjection (Neuhaus et al., 1987, TAG 75: 30), 

25 electroporation of protoplasts (Chupeau et al., 1989, Biotechnology 7: 53) or microbial 
transformation methods such as Agrobacterium mediated transformation (Horsch et al., 
1985, Science 227: 1229; Hiei et al., 1996, Biotechnology 14: 745). 

However, when a foreign genome, chromosome or gene is introduced into a plant, it 
will often segregate in subsequent generations from the genome of the recipient plant or 
30 plant cell during mitotic and meiotic cell divisions and, in consequence, become lost from 
the host plant or plant cell into which it had been introduced. Occasionally, however, the 
introduced genome, chromosome or gene physically combines entirely or in part with the 
genome, chromosome or gene of the host plant or plant cell in a process which is called 
recombination. 

35 Recombination involves the exchange of covalent linkages between DNA molecules 
in regions of identical or similar sequence. It is referred to here as homologous 
recombination if donor and recipient DNA are identical or nearly identical (at least 99% 
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base sequence identity), and as homeologous recombination if donor and recipient DNA 
are not identical but are similar (less than 99% base sequence identity). 

The ability of two genomes, chromosomes or genes to recombine is known to 
depend largely on the evolutionary relation between them and thus on the degree of 
5 sequence similarity between the two DNA molecules. Whereas homologous 
recombination is frequently observed during mitosis and meiosis, homeologous 
recombination is rarely or never seen. 

From a breeder's perspective, the limits within which homologous recombination 
occurs, therefore, define a genetic barrier between species, varieties or lines, in contrast 

io to homeologous recombination which can break this barrier. Homeologous recombination 
is thus of great importance for plant breeding. Accordingly there is a need for a process 
for enhancing the frequency of homeologous recombination in plants. In particular, there 
is a need for a process of increasing homeologous recombination to significantly shorten 
the length of breeding programs by reducing the number of crosses required to obtain an 

is otherwise rare recombination event. 

At least in Escherichia coli. homologous and homeologous recombination are known 
to share a common pathway that requires among others the proteins RecA, RecB, RecC. 
RecD and makes use of the SOS induced RuvA and RuvB, respectively. It has been 
suggested that mating induced recombination follows the Double-Strand Break Repair 

20 model (Szostak et al., 1983, Cell 33, 25-35), which is widely used to describe genetic 
recombination in eukaryotes. Following the alignment of homologous or homeologous 
DNA double helices the RecA protein mediates an exchange of a single DNA strand from 
the donor helix to the aligned recipient DNA helix. The incoming strand screens the 
recipient helix for sequence complementarity, seeking to form a heteroduplex by hydrogen 

25 bonding the complementary strand. The displaced homologous or homeologous strand of 
the recipient helix is guided into the donor helix where it base pairs with its counterpart 
strand to form a second heteroduplex. The resulting branch point then migrates along the 
aligned chromosomes thereby elongating and thus stabilising the initial heteroduplexes. 
Single stranded gaps (if present) are closed by DNA synthesis. The strand cross overs 

30 (Holliday junction) are eventually resolved enzymatically to yield the recombination 
products. 

Although in wild type E. coli homologous and homeologous recombination are thus 
mechanistically similar if not identical, homologous recombination in conjugational 
crosses E. coli x £. coli occurs five orders of magnitude more frequently than 
35 homeologous recombination in conjugational crosses E. coli x S. typhimurium (Matic et 
al. 1995; Cell 80, 507-515). The imbalance in favour of homologous recombination was 
shown to be caused largely by the bacterial MisMatch Repair (MMR) system since its 
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inaciivaiion increased the frequency ot homeologous recombination in E. coli up to 1000 
told (Ruyssiguier et al. 1989. Nature 342. 396-401). 

In £. < oli . the MMR system (reviewed by Modrich 1991. Annual Rev Genetics 25. 
229-253) is composed of only three proteins known as MutS. MutL and MutH. MutS 
5 recognizes and binds to base pair mismatches. MutL then forms a stable complex with 
mismatch bound MutS. This protein complex now activates the MutH intrinsic single 
stranded endonuclease which nicks the strand containing the misplaced base and thereby 
prepares the template for DNA repair enzymes. 

During recombination. MMR components inhibit homeologous recombination. In 
to vitro experiments demonstrated that MutS in complex with MutL binds to mismatches at 
the recombination branch point and physically blocks RecA mediated strand exchange and 
heteroduplex formation (Worth et al.. 1994; PNAS 91. 3238-3241). Interestingly, the 
SOS dependent RuvAB mediated branch migration is insensitive to MutS/MutL, 
explaining the observed slight increase in SOS dependent homeologous recombination, 
is Homeologous mating even induces the SOS response, thereby taking advantage of RuvAB 
• induction (Matic et al. 1995. Cell 80. 507-515). 

The MMR system thus appears to be a genetic guardian over genome stability in E. 
coli. In this role it essentially determines the extent to which genetic isolation, that is, 
speciation. occurs. The diminished sensitivity of the SOS system to MMR, however, 
20 allows (within limits) for rapid genomic changes^ times of stress, providing the means 
for fast adaptation to altered environmental conditions and thus contributing to intraspecies 
genetic variation and species evolution. 

The important role of MMR in preserving genomic integrity has been established 
also in certain eukaryotes. In its efficiency, the human MMR. for example, may even 
25 counteract potential gene therapy tools such as triple-helix forming oligonucleotides 
including RNA-DNA hybrid molecules (Havre et al.. 1993, J. Virology 67: 7234-7331; 
Wang et al.. 1995, Mol. Cell. Biol. 15: 1759-1768; Kotani et al., 1996, Mol. Gen. 
Genetics 250: 626-634; Cole-Strauss et al., 1996, Science 273: 1387-1389). Such 
oligonucleotides are designed to introduce single base changes into selected DNA target 
30 sequences in order to inactivate for example cancer genes or to restore their normal 
function. The resulting base mismatches however are recognised by the mismatch repair 
system which then directs removal of the mismatched base, thereby reducing the 
efficiency of oligonucleotide induced site-specific mutagenesis. 

To date, two families of related genes, homologous to the bacterial MutS and MutL 
35 eenes have been identified or isolated in yeast and mammals (recent reviews by Arnheim 
and Shibata. 1997. Curr. Opinion Genet. Dev. 7, 364-370; Modrich and Lahue, 1996, 
Annual Rev. Biochem. 65. 101-133; Umar and Kunkel. 1996. Eur. J. Biochem. 238, 297- 
307). Biochemical and genetic analysis indicated that eukaryotic MutS homologs (MSH) 
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and MutL homologs (MLH. PMS). respectively, fulfil similar protein functions as their 
bacterial counterparts. Their relative abundance, however, could reflect different 
mismatch specificity and/or specialisation for different tissues or organelles or 
developmental processes such as mitotic versus meiotic recombination. 

5 To date, six different genes homologous to MutS have been isolated in yeast 
iyMSH). and their homologs have been found in mouse (mMSH) and human (hMSH), 
respectively. Encoded proteins yMSH2, yMSH3 and yMSH6 appear to be the main MutS 
homologs involved in MMR during mitosis and meiosis in yeast, where the 
complementary proteins MSH3 and MSH6 alternatively associate with MSH2 to recognise 
10 different mismatch substrates (Masischky et a!.. 1996, Genes Dev. 10, 407-420). Similar 
protein interactions have been demonstrated for the human homologs hMSH2, hMSH3 
and hMSH6 (Acharya et al.. 1996, PNAS 93. 13629-13634). 

MutL homologs (MLH and PMS), recently reviewed by Modrich and Lahue (1996, 
Annual Rev. Biochem. 65. 101-133) have so far been found in yeast (yMLHl and 

is yPMSl), mouse (mPMS2) and human (hMLHl, hPMSi and hPMS2). The hPMS2 is a 
member of a family of at least 7 genes (Horii et a!.. 1994, Biochem. Biophys. Res. 
Commun. 204, 1257-1264) and its gene product is most closely related to yPMSl. Prolla 
et al. (1994, Science 265, 1091-1093) presented evidence for yPMSl and yMLHl to 
physically associate with each other and, together, to interact with the MutS homolog 

20 yMSH2 to form a ternary complex involved in mismatch substrate binding. 

However, while medical interest in mismatch repair has prompted extensive research 
on MMR in bacteria, yeast and mammals, MMR genes have not been isolated from higher 
plants prior to the present invention and no attempts to adjust the plant MMR to plant 
breeding needs have been reported. 

25 SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence encoding a polypeptide 
functionally involved in the DNA mismatch repair system of a plant. In one form of this 
embodiment, the invention provides an isolated and purified DNA molecule comprising a 

30 polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human. More particularly, the invention provides 
polynucleotide sequences encoding polypeptides which are homologous to the mismatch 
repair polypeptides MSH3 and MSH6 of Saccharomyces cerevisiae. Still more 
particularly, the invention provides the coding sequences of the genes AtMSH3 and 

35 AtMSH6 of Arabidopsis thaliana. as defined hereinbelow, and polynucleotide sequences 
encoding polypeptides which are homologous to polypeptides encoded by AtMSH3 and 
AtMSH6. 
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According to a second embodiment of the invention, there is provided an isolated 
and purified polypeptide functionally involved in the DNA mismatch repair system of a 
plant, for example a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human such as a polypeptide encoded by the genes AiMSH3 or AtMSH6 of 
5 Arabidopsis thaliana, as defined hereinbelow. 

According to a third embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence selected from the group 
consisting of (i) a sequence encoding a polynucleotide which is capable of interfering with 
the expression of a plant polynucleotide sequence encoding a polypeptide which is 
io homologous to a mismatch repair polypeptide of a yeast or of a human and thereby 
disabling said plant polynucleotide sequence; and (ii) a sequence encoding a polypeptide 
* capable of disrupting the DNA mismatch repair system of a plant. 

According to a fourth embodiment of the invention there is provided a chimeric gene 
comprising a DNA sequence selected from the group consisting of (i) a sequence encoding 

15 a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence, and (ii) a sequence encoding a polypeptide capable of disrupting the DNA 
mismatch repair system of a plant: together with at least one regulation element capable of 

20 functioning in a plant cell. Examples of such regulation elements include constitutive, 
inducible, tissue type specific and cell type specific promoters such as 35S, NOS, PRla, 
AoPRl and DMC1. Typically, a chimeric gene of the fourth embodiment will also 
include at least one terminator sequence, more typically exactly one terminator sequence. 

In the third and fourth embodiments, said interference, by said polynucleotide 
25 sequence, with the expression of a plant polynucleotide sequence encoding a polypeptide 
which is homologous to a mismatch repair peptide of a yeast or a human typically occurs 
by hybridisation or by co-suppression. 

According to a fifth embodiment of the invention there is provided a plasmid or 
vector comprising a chimeric gene of the fourth embodiment. A vector of the fifth 
30 embodiment may be, for example, a viral vector or a bacterial vector. 

According to a sixth embodiment of the invention, there is provided a plant cell 
stably transformed, transfected or electroporated with a plasmid or vector of the fifth 
embodiment. 

According to seventh embodiment of the invention, there is provided a plant 
35 comprising a cell of the sixth embodiment. 

According to an eighth embodiment of the invention, there is provided a process for 
at least partially inactivating a DNA mismatch repair system of a plant cell, comprising 
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transforming or transfecting said plant cell with a DNA sequence of the third embodiment 
or a chimeric gene of the fourth embodiment or a plasmid or vector of the fifth 
embodiment, and causing said DNA sequence to express said polynucleotide or said 
polypeptide. 

5 According to a ninth embodiment of the invention, there is provided a process for 

increasing genetic variation in a plant comprising obtaining a hybrid plant from a first 
plant and a second plant, or cells thereof, said first and second plants being genetically 
different; altering the mismatch repair system in said hybrid plant; permitting said hybrid 
plant to self-fertilise and produce offspring plants; and screening said offspring plants for 

io plants in which homeologous recombination has occurred. For example, homeologous 
recombination may be evidenced by new genetic linkage of a desired characteristic trait or 
of a gene which contributes to a desired characteristic trait. 

According to a tenth embodiment of the invention there is provided a process for 
obtaining a plant having a desired characteristic, comprising altering the mismatch repair 
is system in a plant, cell or plurality of cells of a plant which does not have said desired 
characteristic, permitting mutations to persist in said cells to produce mutated plant cells, 
deriving plants from said mutated plant cells, and screening said plants for a plant having 
said desired characteristic. 

In a preferred form of the ninth and tenth embodiments of the invention, the step of 
20 altering the mismatch repair system comprises introducing into said hybrid plant, plant, 
cell or cells a chimeric gene of the fourth embodiment and permitting the chimeric gene to 
express a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence in a mismatch repair gene of the hybrid plant, plant, cell or cells, 
or a polypeptide capable of disrupting the DNA mismatch repair system of the hybrid 
25 plant or cells. 

In other embodiments, the invention provides (a) an oligonucleotide capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule of the first 
embodiment; (b) an oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO: 18 and (c) an oligonucleotide capable of 
30 hybridising at 45°C under standard PCR conditions to the DNA of SEQ ID NO:30; with 
the proviso that the oligonucleotide of (a), (b) and (c) is other than SEQ ID NO:l or SEQ 
IDNO:2. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure I provides a diagrammatic representation of the primer sequences used to 
35 isolate AtMSH3. 

Figure 2 is a plasmid map of clone 52, showing restriction enzyme cleavage sites in 
the 5' half of the full-length cDNA for AtMSH3. 
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Figure 3 is a plasmid map of clone 13. showing restriction enzyme cleavage sites in 
the 3' half of the full-length cDN A for AtMSH3. 

Figure 4 is a sequence listing of the coding sequence of AtMSH3, together with a 
deduced sequence of the encoded polypeptide. 
5 Figure 5 is a protein alignment of yeast (Saccharomyces cerevisiae) and Arabidopsis 

thaliana MSH3 protein. 

Figure 6 provides a diagrammatic representation of the primer sequences used to 

isolate AtMSH6. 

Figure 7 is a plasmid map of clone 43, showing restriction enzyme cleavage sites in 
io the 5' half of the full-length cDNA for AtMSH6. 

Figure 8 is a plasmid map of clone 62-. showing restriction enzyme cleavage sites in 
the 3' half of the full-length cDNA for AtMSH6. 

Figure 9 is a sequence listing of the coding sequence of AtMSH6, together with a 
deduced sequence of the encoded polypeptide, 
is Ficure 10 is a protein alignment of yeast (Saccharomyces cerevisiae) and 

Arabidopsis thaliana MSH6 protein. 

Ficure 11 is a genomic sequence listing of AtMSH6. 
Figure 12 is a plasmid map of plasmid pPF13. 
Figure 13 is a plasmid map of plasmid pPF14. 
20 Figure 14 is a plasmid map of plasmid pCWl 86. 

Figure 15 is a plasmid map of plasmid pCW187. 
Figure 1 6 is a plasmid map of plasmid pPF66. 
Figure 1 7 is a plasmid map of plasmid pPF57. 

Figure 18 is a diagrammatic representation of an antisense gene construction for use ; 
25 in homeologous meiotic recombination. 

Figure 19 is a plasmid map of plasmid p3243. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is based on the inventors' discovery that there exist in higher 
plants genes which are homologous to MMR genes in E. coli, and to MMR genes in 

30 yeasts and humans. 

Thus, the inventors have identified genes, herein designated AtMSH3 and AtMSH6, 
of the plant Arabidopsis thaliana which encode the proteins AtMSH3 and AtMSH6. 
These plant proteins are homologous to yMSH3 and yMSH6, respectively. The present 
inventors have isolated cDNAs encoding the proteins AtMSH3 and AtMSH6 and have 

as isolated the complete gene encoding AtMSH6. Given the teaching herein, other genes 
(for example AtMSH2, and genes of other plants) may be obtained which are involved in 
DNA mismatch repair in plants, including other genes which encode polypeptides 
homologous to MMR proteins of yeasts or humans, such as genes which encode 
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polypeptides homologous to yeast MSH2. MLH1 or PMS2. or to human MLHL PMS1 or 
PMS2. For example, given the teaching herein, genes of members of the Brassicaceae 
family or of other unrelated families, for example the Poaceue. the Solanaceae. the 
Asteraceae, the Malvaceae, the Fabaceae, the Linaceae. the Canabinaceae* the Dauaceae 
s and the Cucurbitaceae family, and which encode polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained. 

Examples of plants whose genes encoding polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained given the teaching herein include maize, 
wheat, oats, barley, rice, tomato, potato, tobacco, capsicum, sunflower, lettuce, 
10 artichoke, safflower, cotton, okra, beans of many kinds including soybean, peas, melon, 
squash, cucumber, oilseed rape, broccoli, cauliflower, cabbage, flax, hemp, hops and 
carrot. 

Within the meaning of the present invention, a first polypeptide is defined as 
homologous to a second polypeptide if the amino acid sequence of the first polypeptide 
is exhibits a similarity of at least 50% on the polypeptide level to the amino acid sequence of 
the second polypeptide. 

A procedure which may be followed to obtain genes AtMSH3 and ArMSH6 is 
described in Example 1. Essentially the same technique may be applied to obtain other 
mismatch repair genes of Arabidopsis thaliana, and essentially the same technique as 

20 exemplified herein may be applied to cDNA obtained by reverse transcription of RNA 
from other plants. Alternatively, given the sequence information disclosed herein, other 
degenerate oligonucleotide primers, especially oligonucleotides of the invention which are 
capable of hybridising at 45°C under standard PCR conditions (such as the conditions 
described in Example 1 using primers UPMU and DOMU) to AtMSH3 and/or AtMSH6 

25 may be designed and obtained for use in isolating sequences of plant mismatch repair 
genes which are homologous to AtMSH3 or AtMSH6, from other plants. Similarly, 
oligonucleotides of the invention which are capable of hybridising at 45°C under standard 
PCR conditions to plant mismatch repair genes of plants other than Arabidopsis thaliana 
also fall within the scope of the present invention and may be utilised to obtain mismatch 

30 repair genes of still other plants. Typically, such oligonucleotides are capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule which encodes a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or a human. 
The temperature at which oligonucleotides of the invention hybridise to AtMSH3 and/or 
AtMSH6, or to plant mismatch repair genes of plants other than Arabidopsis thaliana, or 

as to DNA molecules which encode polypeptides which are homologous to a mismatch repair 
polypeptide of a yeast or a human may be higher than 45°C, for example at least 50°C, or 
at least 55°C, or at least 60°C or as high as 65°C. 
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The successful gene isolation disclosed herein demonstrates for the first time the 
existence of MMR in higher plants and indicates the presence of other plant MMR genes. 
For example, genes encoding the plant homologs of MSH1. MSH2. MSH4. MSH5. 
PMSl. PMS2 and MLH1 may be identified given the teaching herein. Such genes, as 
5 well as those specifically described herein, separately or in combination, are useful in 
manipulating the plant MMR for plant breeding purposes. Thus, for example, the plant 
MMR may be altered by including in a plant cell a polynucleotide sequence as defined 
herein above with reference to the third embodiment of the invention, and causing the 
polynucleotide sequence to express either a polynucleotide which disables a plant MMR 
to gene, or a polypeptide which disrupts the plant's MMR system. 

The DNA molecule of the third embodiment of the invention includes a 
polynucleotide sequence (herein referred to as a MMR altering gene) which may for 
example encode sense, antisense or ribozyme molecules characterised by sufficient base 
sequence similarity or complementarity to the gene to be altered to permit the antisense or 
,5 ribozyme molecule to hybridise with the plant MMR gene in vivo or to permit the sense 
molecule to participate in co-suppression. Alternatively, the MMR altering gene may 
encode a protein or proteins which interfere with the activity of a plant MMR protein and 
thus disrupt the plant' s MMR system. For example, such encoded proteins may be 
antibodies or other proteins capable of interfering with MMR protein function, such as by 
20 complexing with a protein functionally involved in plant MMR thereby disrupting the 
MMR of the plant. An example of such a protein is the MSH3 protein of Arabidopsis 
thaliana described herein or a protein of another plant which is homologous to the MSH3 
protein of A. thaliana. For instance, overexpression of MSH3 in a plant cell causes 
MSH2 present in the cell to be substantially completely complexed. disrupting the 
25 mismatch repair mechanism or mechanisms in the cell which are functionally dependent 
on the presence of a complex of MSH2 with MSH6. Similarly, mismatch repair 
mechanisms which depend on the presence of a complex of MSH2 and MSH3 may be 
disrupted by the overexpression of MSH6. 

A chimeric gene of the fourth embodiment, incorporating a MMR altering gene, 
30 may be prepared by methods which are known in the art. Similarly, the MMR altering 
gene may be introduced into a plant cell, regenerating tissue or whole plant by techniques 
known in the art as being suitable for plant transformation, or by crossing. Known 
transformation techniques include Agrobacterium tumefaciens or A. rhizogenes mediated 
gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 
35 The MMR altering gene or genes are typically expressed from suitable promoters. 
Suitable promoters may direct constitutive expression, such as the 35S or the NOS 
promoter. Usually, however, the promoter will direct either inducible or tissue specific 
(e.g. callus; embryonic tissue; etc.), cell type specific (e.g. protoplasts; meiocytes: etc.) 
or developmental (e.g. embryo) expression of the altering gene or genes, in order for the 
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MMR system to function in tissue types or cell types, or at developmental stages of the 
plant, in which it is not desirable for the MMR system to be altered. Using such 
promoters, therefore, the activity of a MMR altering gene may be limited to a specific 
stage during plant development or it may be altered by controlling conditions external to 

5 the plant, and the deleterious effects of a permanently disabled or altered DNA mismatch 
repair system in a plant may be avoided. Examples of suitable promoters which are not 
constitutive are known in the art and include inducible promoters such as P/Jla (reviewed 
by Gatz, 1997, Annual Rev. Plant Phys. Plant Mol. Biol. 48: 89), tissue specific 
promoters such as AoPRl (Sabahattin et al., 1993, Biotechnology 11: 218), and cell-type 

io specific promoters such as DA/C1. 

A chimeric gene in accordance with the invention may further be physically linked 
to one or more selection markers such as genes which confer phenotypic traits such as 
herbicide resistance, antibiotic resistance or disease resistance, or which confer some 
other recognisable trait such as male sterility, male fertility, grain size, colour, growth 
is rate, flowering time, ripening time. etc. 

The process of the tenth embodiment of the invention provides, for example, a 
process for generating intraspecies genetic variation by altering the mismatch repair 
system in a plant cell, in regenerating plant tissue or in a whole plant. The plant cell, 
regenerating tissue or whole plant includes and expresses one or more MMR altering 

20 genes which are capable of altering mismatch repair in the plant cell, regenerating tissue 
or whole plant. Alteration of MMR may be achieved, for example, by inactivating the 
genes encoding plant MSH3 and/or plant MSH6. It is preferred to inactivate the plant 
MSH3 and MSH6 encoding genes at the same time and in the same plant cell, 
regenerating tissue or whole plant. Typically in this preferred form of the invention 

25 inactivation of either plant MSH3 or MSH6 alone is insufficient to substantially alter the 
plant's mismatch repair system and only when both MSH3 and MSH6 are inactivated 
simultaneously is the plant's mismatch repair system sufficiently altered to prevent the 
MMR system from recognising base pair mismatches, base insertions or deletions as a 
result of DNA replication errors, DNA damage, or oligonucleotide induced site-specific 

30 mutagenesis. However, in some applications of the invention, inactivation of only one 
gene may also be used to cause genomic instability or increase the efficiency of site- 
specific mutagenesis. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the genome of the plant cell, regenerating tissue or 
35 whole plant in order to restore mismatch repair in the plant cell, regenerating tissue or 
whole plant. The MMR altering gene or genes may be inactivated by means of known 
gene inactivation tools, such as ribozymes, or may be removed from the genome using 
gene elimination systems known in the art, such as CRE/LOX. It is preferred to render 
two genes, whose gene products combine to incapacitate MMR, ineffective by separating 
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the alierina senes throueh segregation. Therefore, in a preferred embodiment of the 
invention a first plant cell or plant is generated in which only plant MSH3 is incapacitated, 
and a second plant cell or plant is generated in which only plant MSH6 is incapacitated. 
The combination of both genomes, for example by crossing, then produces significant 
5 MMR deficiency in those cells or plants which have inherited both altering genes. It the 
altering genes are expressed from unlinked loci, gene segregation restores MMR activity 
in the progeny of the cells or plants. 

In a process of the ninth embodiment of this invention, homeologous recombination 
is enhanced between different genomes, chromosomes or genes in plant cells or plants by 
10 altering MMR in said plant cells or plants. Such genomes, chromosomes or genes are 
characterised in that they originate from different plant families, genera, species, 
subspecie's, plant varieties or lines. Hybrid plant cells or hybrid plants may be produced 
by crossing, by cell fusion or by other techniques known in the art. These plant cells or 
plants are farther characterised by expressing one or more genes that are capable of 
i s altering mismatch repair in the plant cell or plants. 

In the process of the ninth embodiment, the homeologous recombination is typically 
for the purpose of introducing a desired characteristic in the hybrid plant. In this typical 
application of the process of the ninth embodiment, and in the process of the tenth 
embodiment the desired characterise may be any characteristic which is of value to the 
20 plant breeder. Examples of such characteristics are well known in the art and include 
altered composition or quality of leaf or seed derived storage products (e.g. oil, starch, 
protein), altered composition or quality of cell walls (e.g. decrease in lignin content), 
altered growth rate, prolonged flowering, increased plant yield or grain yield, altered 
plant morphology, resistence to pathogens, tolerance to or improved performance under 
25 environmental stresses of various kinds, etc. 

In a preferred form of the tenth embodiment, an MMR altering gene is co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant to produce a 
hybrid plant cell or hybrid plant in which homeologous recombination can occur. 
30 Suitably, the MMR proficient plant cell or MMR proficient plant may also include an 
MMR altering gene. For example a gene capable of inactivating plant MSH3 may be co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant in which MSH6 
is inactivated. A resultant hybrid plant in which homeologous recombination occurs will 
35 include both the MSH3 and MSH6 altering genes and its MMR system will therefore be 
inactivated. 

In this form of the invention, if hybrid plants are to be produced by crossing, the 
MMR altering gene preferably originates from the male parent, thus ensuring that the 
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MMR altering gene is always introduced and is not present in the recipient cell. That is, 
the MMR of the recipient cell, prior to introduction of the MMR altering gene, is 
typically proficient. Alternatively, if an MMR altering gene is present in a recipient cell it 
may be ineffective or inefficient on its own. or it may be linked to an inducible or tissue 
5 specific or cell type specific promoter which only renders the MMR altering gene active 
under limited conditions. 

Thus, in a preferred form of the process of the ninth embodiment, the MMR system 
of the hybrid plant is initially unaltered. In this form of the process, the step of altering 
the mismatch repair system may comprise introducing into the hybrid plant, or cells 
io thereof, a MMR altering gene, such as by Agrobacterium tumefaciens or A. rhizogenes 
mediated gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters, as 
described above. Preferably, the promoter is transcriptionally active in mitotically and 
meiotically active tissue and/or cells to ensure MMR alteration after chromosome pairing 

15 at mitosis and meiosis, respectively. The preferred timing for MMR alteration is at 
meiosis. because recombinant genomes, chromosomes or genes are directly transmitted to 
the progeny. A suitable meiocyte specific promoter is for example the DMC1 promoter 
from Arabidopsis thaliana ssp. Ler. (Klimyuk and Jones, 1997, Plant J. 11, 1-14). 
However, mitotic homeologous recombination is also a desirable outcome as somatic 

20 recombination events can be transmitted to offspring due to the totipotency of plant cells 
and the lack of predetermined germ cells in plants. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the hybrid plant or hybrid plant cells, in order to 
restore mismatch repair in the hybrid plant or hybrid plant cells. The MMR altering gene 

25 or genes may be inactivated by means of known gene inactivation tools as described 
herein above. 

EXAMPLES 

Example 1. Cloning of the AtMSH3 and AtMSH6 coding sequences 

Isolation of partial AtMSH3 and AtMSH6 consensus sequences 

30 Degenerate oligonucleotides UPMU (SEQ ID NO: 1 ) and DOMU (SEQ ID NO:2) 

UPMU CTGGATCCACIGGICCIAA(C/T)ATG 
DOMU CTGGATCC(A/G)TA(A/G)TGIGTI(A/G)C(A/G)AA 
were used to isolate AtMSH3 and AtMSH6 sequences by PCR amplification. 

Primers UPMU and DOMU correspond to conserved amino acid sequences of the 
35 proteins MutS (£. coli and S. typhimurium\ HexA (S. pneumoniae). Rep I (mouse) and 
Duel (human). The conserved regions to which they are targeted are TGPNM for UPMU 
(amino acid positions 852-856 for AtMSH6 and 816-820 for AtMSH3) FATHY or FVTHY 
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for DOMU (amino acid positions 064-968 tor AtMSH6 and 928-932 for AiMSH3. 
respectively.. These primers have been used to isolate MSH2 and MSHI from yeast 
( Recnan and Kolodner. Genetics 1 32: 963-973 ( 1 992)) and MSH2 from Xenopm and mouse 
(Varlet et al,. Nuc. Acids Res. 22:5723-5728 (1994)). 
5 Template single strand cDNA was produced by reverse transcription of 2 ug total 
RNA from a cell suspension culture of Arabidopsis thaliana ecotype Columbia (Axelos et 
at. 1989. Mol. Gen. Genetics 219: 106-112). The PCR reaction was performed under the 
following conditions in a final volume of lOOul: 0.2mM dNTP, luM each primer, 1XPCR 
buffer, lu Taq DNA polymerase (Appligene) in the presence of template cDNA. PCR 
,o parameters were 5 minutes at 94°C, followed by 30 cycles of 40 seconds at 95°C. 90 
seconds at 45°C, 1 minute at 72°C. The amplification products were cloned into pGEM-T 
vector (Promega) and sequenced. Two different clones were isolated, S5 (350bp) was 
homologous to MSH3, S8 (327bp) was homologous to MSH6. Complete cDNA sequences 
were then isolated according to the Marathon cDNA amplification kit procedure (Clontech). 
, 5 In suramarv. this procedure involves producing double stranded cDNA by reverse 
transcription of 2ug polyA^ RNA from the cell suspension culture of Arabidopsis. 
Adaptors are ligated on each side of the cDNA. The ligated cDNA is used as a template for 
5" and 3' RACE PCR reactions in the presence of primers that are specific for the adaptor on 
one side (API and AP2), and specific for the targeted gene on the other side. A 5' and a 3* 
20 fragment that overlap are thus produced for each gene. The complete gene coding sequence 
can be reconstituted taking advantage of a unique restriction site, if available, in the 
overlapping region. Specific details of this procedure as it was used to isolate At\1SH3 and 
AtMSH6 coding regions, are as follows. 
Isolation of AtMSH3 co mplete coding sequence 
25 From the sequence of clone S5, primer 636 ( SEQ ID NO:3) was designed: 

636 TGCTAGTGCCTCTTGCAAGCTCAT. 
Primer API (SEQ ID NO:4) is complementary to a portion of an adaptor sequence 
which had been ligated to the 5' and 3' ends of Arabidopsis cDNA: 

API CCATCCTAATACGACTCACTATAGGGC. 
30 PCR performed on the ligated cDNA with primers 636 and API for the 5' RACE PCR 
was followed by a second round of amplification with the nested primers AP2 (SEQ ID 

NO:5) and S525 (SEQ ID NO:6) 

AP2 ACTCACTATAGGGCTCGAGCGGC 
S525 AGGTTCTGATTATGTGTGACGCTTTACTTA 
35 (the latter was also designed to correspond to a part of the sequence of clone S5) and 
produced a 2720bp DNA fragment. Figure 1 provides a diagrammatic representation of the 
primer sequences used to isolate AtMSH3. Another primer (S51, SEQ ID NO:7) 
S51 GGATCGGGTACTGGGTTTTGAGTGTGAGG 
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was designed closer lo the 5* border and permitted the determination of 99bp upstream to 
the A T( i initiation codon. For the 3* RACE PCR. a first PCR reaction was performed with 
primers API and 635 (SEQ ID N():8). 

63 5 GC ACGTGCTTG ATGGTGTTTTC AC 

5 followed by a second round of amplification, using the nested primers AP2 and S523 (SEQ 
ID NO:9) 

S523 TCAGACAGTATCCAGCATGGCAGAAGTA 
which produced a DNA fragment of 890bp. Both DNA fragments were subcloned into 
pGEM-T and sequenced. Since PCR amplification using the Expand Long Template PCR 
io System (Boehringer-Mannheim) produced errors in the sequence, new oligonucleotides 
were designed to isolate those sequences again by PCR. but with the high fidelity DNA 
polymerase Pfu. PCR with primers 1 S5 (SEQ ID NO: 10) and S53 (SEQ ID NO:l 1) 
I S5 ATCCCGGGATGGGCAAGCAAAAGCAGCAGACGA 
S53 GACAAAGAGCGAAATGAGGCCCCTTGG 
15 amplified the 1244bp fragment clone 52 (SEQ ID NO: 12. cloned into pUC18/Smal). PCR 
with primers S52 (SEQ ID NO: 1 3) and 2S5 (SEQ ID NO: 14) 

2S5 ATCCCGGGTCAAAATGAACAAGTTGGTTTTAGTC 
S52 GCCACATCTGACTGTTCAAGCCCTCGC 
amplified the 2I04bp clone 13 (SEQ ID NO:15, cloned into pUC18/Smal). The complete 
20 coding sequence of the AtMSH3 gene was reconstructed in pUC18 by ligating the 5' half 
of AtMSH3 (clone 52) to the 3' half of AtMSH3 (clone 13) after digesting with BamHl 
which has a unique cleavage site in the overlapping region of both clones. This 
manipulation yielded plasmid pPF26. The Smal fragment from pPF26 contains the 
complete AtMSH3 coding sequence. The remaining primers referred to in Figure 1 are as 
25 follows: 

S5 1 GGATCGGGTACTGGGTTTTGAGTGTGAGG (SEQ ID NO: 16) 

S525 - AGGTTCTGATTATGTGTGACGCTTTACTTA (SEQ ID NO: 17) 
Figures 2 and 3 provide plasmid maps of clones 52 and 13 respectively, showing 
restriction enzyme cleavage sites. The complete AtMSH3 coding sequence (SEQ ID NO: 18) 
30 is 3246bp long and is shown in Figure 4 together with the deduced sequence (SEQ ID 
NO: 19) of the encoded polypeptide. AtMSH3 is clearly homologous to the yeast and mouse 
MSH3 genes. A sequence alignment of polypeptides encoded by AtMSH3 and that encoded 
by Saccharomyces cerevisiae MSH3 is set out in Figure 5. 

Isolation of the AtMSH6 complete coding sequence and genomic sequences 

35 The same procedure allowed isolation of the AtMSH6 cDNA. Figure 6 provides a 

diagrammatic representation of the primer sequences used to isolate AtMSH6. For the 5* 
RACE PCR, primers 638 (SEQ ID NO:20) and API (SEQ ID NO:4) 

638 TCTCTACCAGGTGACGAAAAACCG 
allowed the amplification of a 2889 DNA fragment. Primer S81 (SEQ ID NO:21) 
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S81 CGTCGCCTTTAGCATCCCCTTCCTTCAC 
helped define the I42bp upstream to the ATG initiation codon. On the 3" side. RACE PCR 
was initially performed with primers S823 (SEQ ID NO:22) and API (SLQ ID NO.4) . 

S823 GCTTGGCGCATCTAATAGAATCATGACAGG 
5 and then with the nested primers 637 (SEQ ID NO:23) and AP2 (SEQ ID NO:5). 

637 GACAGCGTCAGTTCTTCAGAATGC 
to produce a 774bp DNA fragment. As for AiMSH3, those fragments were cloned and 
sequenced. Re-isolation of the DNA sequence using the high fidelity Pfu polymerase and 
newly designed primers 1S8 (SEQ ID NO:24) and S 83 (SEQ ID NO:25) (for the 5" side) led 
10 to a 2182 bp DNA fragment identified as clone 43 (SEQ ID NO:26, cloned in 
pUC18/Smal), and a 1379bp clone identified as clone 62 (SEQ ID NO:27, also cloned in 

pUC18/SmaI). 

1S8 ATCCCGGGATGCAGCGCCAGAGATCGATTTTGT 

2S8 ATCCCGGGTTATTTGGGAACACAGTAAGAGGATT (SEQ ID 

is NO:28) 

582 GCGTTCGATCATCAGCCTCTGTGTTGC (SEQ ID NO:29) 

583 CGCTATCTATGGCTGCTTCGAATTGAG 

Figures 7 and 8 provide plasmid maps of clones 43 and 62 respectively, showing restriction 
enzyme cleavage sites. Clones 43 and 62 were digested by the Xmn\ restriction enzyme for 
20 which a unique site is present in their overlapping region and then ligated. The complete 
AtMSH6 coding sequence (SEQ ID NO:30) is 3330bp long and is shown in Figure 9 
together with the deduced sequence (SEQ ID NO:3I) of the encoded polypeptide. AtMSH6 
is clearly homologous to the yeast and mouse .WS//6genes. A sequence alignment of 
polypeptides encoded by AtMSH6 and that encoded by Saccharomyces cerevisiae MSH6 is 

25 set out in Figure 10. 

An AIMSH6 genomic sequence was also isolated from a genomic DNA library 

constituted after partial Sau3Al digestion of DNA from the Arabidopsis cell suspension. 

8062bp were sequenced that covered the AtMSH6 gene and show colinearity with the 

cDNA. 16 introns are found scattered along the gene. The complete genomic sequence 
30 (SEQ ID NO:98) is shown in Figure 11. 

Example 2. A measure of somatic variation in MMR deficient plants 

Constructs 

Constructs with antisense AtMSH3 or antisense AtMSH6 or both A t MSH3/A tMSH6 
under the control of a single 35S promoter have been inserted into the binary vector 
35 pPZP121 (Hajdukiewicz et aL Plant Mol. Biol. 23. 793-799) between the right and left 
borders of the T-DNA. The pPZP121 plasmid confers chloramphenicol resistance to 
Escherichia coli or Agrobacterium tumefaciens bacteria. The aacCl gene is carried by the 
T-DNA and allows selection of transformed plant cells on gemamycin (Hajdukiewicz et al., 
Plant Mol. Biol. 25. 989-994). For the purpose of expressing antisense constructs, a 35S 
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promoter/terminator cassette with a polylinker was introduced into pPZPl 21 . The 3' ends of 
the considered genes have been chosen since this region seems more efficient for antisense 
inhibition. For AtMSH3 this corresponds to clone 13 (2104bp). for At MS H 6 this 
corresponds to clone 62 (1379bp). Clone 13 comprises 2104bp of the 3' region that were cut 

s off the pUC 18 vector by Sall/Sstl restriction, blunted with T4 DNA polymerase and ligated 
into the T4 DNA polymerase blunted BamWl site of pPZP121/35S, creating clone pPF13. 
The same procedure was followed for the 3* region of AtMSHt clone 62 (1379bp) thus 
creating plasmid pPF14. For the double constructs, the 3' region (from clone 62) of 
AtMSH6 was introduced ahead of the AtMSH3 region into pPF13 creating pCW186 and 

io reciprocally, the 3* region of AtMSH3 (from clone 13) was introduced ahead of AtMSH6 into 
pPF14, creating pCW187. 

These constructs were introduced into the Arabidopsis cells (as described below) of 
wildtype Columbia and of the Columbia tester line. 

An alternative strategy to antisense inhibition of AtMSH6 comes from experiments of 
15 Marra et al. (1998. Proc. Natl. Acad. Sci USA 95. 8568-8573) who show that 
overexpression of functional MSH3 results in depletion of MSH6 protein in human cells. 
This depletion may generate a mismatch repair mutant phenotype. 

For the purpose of overexpressing functional AtMSH3 protein in plant cells, the 
complete MSH3 coding region was excised from pPF26 (example 1) by digestion with 

20 SmaU and was inserted into the Smal site of pCW164. The resulting construct was named 
pPF66. It contains a complete AiMSH3 gene under the control of the 35S promoter inside 
the left (LB) and right (RB) border of the T-DNA. This T-DNA also contains the hpt2 gene 
for gentamycin selection. Plasmid pPF66 was introduced into Arabidopsis cells as 
described below. One cell clone was selected which clearly overexpressed the AtMSH3 

25 gene as shown by Northern analysis. Figures 12-16 provide plasmid maps of plasmids 
pPF13, pPF14, pCW186, pCW187 and pPF66. respectively. 

Construction of tester construct 

For the purpose of Forward Mutagenesis Assays, a tester construct was built 
containing the coding regions for nptlh codA, uidA. All three genes are driven by the 35S 

30 promoter and are terminated by the 35S terminator. This construct was obtained by 
introducing an EcoRl fragment encoding the codA cassette (2.5kb) and a Hindlll fragment 
encoding the uidA (GUS) cassette (2.4kb) into the pPZPlll vector (Hajdukiewicz et 
al M 1994, Plant Mol Biol 23: 793-799) which already contained the nptll expression cassette. 
This new plasmid was named pPF57. NptW is used to select for transformed plant cells. 

35 GUS is used to analyse the degree of gene silencing in the construct (i.e. to identify cell 
lines in which the transgenes are expressed), and codA is used as a marker for forward 
mutagenesis (described below). 
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The plasmid map of pPF57 is provided in Figure 17. 
Plant cell transformation 

The constructs are introduced into Agrohacterium by electroporation. Plant cells are 
then transformed by co-cultivation. A suspension culture of Arabidopsis thuliana cells that 

s has been established by Axelos et al. (1992, Plant Physiol. Biochem. 30, 1-6) may be used. 
One day old freshly subcultured cells are diluted five times into AT medium (Gamborg B5 
medium. 30g/l sucrose. 200ug/l NAA). lOul of saturated Agrobacterium containing the 
transforming T-DNA constructs are added to 10ml diluted cells in a 100ml erlenmeyer. The 
co-cultivation is agitated slowly (80rpm) for 2 days. The cells are then washed 3 times into 

io AT medium and finally resuspended in the same initial volume (10ml). The culture is 
agitated for 3 days to allow expression before plating on selection plates (AT/BactoAgar 
8g/l+gentamycin 50ug/ml). Transformed individual calli are isolated 3 weeks later. 

Tester Strain 

The tester construct on plasmid pPF57 was introduced into Arabidopsis cells of 
,s wildtype Columbia using the transformation protocol described above. Among 10 candidate 
transtbrmants. one cell clone was shown (by Southern analysis) to have a unique T-DNA 
insertion. All three genes were shown to be functional in this cell line as indicated by 
resistance to kanamycin, blue staining in the presence of X-Glu (CUS). and sensitivity to 
5-fluoro-cytosine (codA). 

MMR altering genes (described above) were then introduced individually into the 
tester line and transformed cells are used for analysis of both Microsatellite Instability and 
Forward Mutagenesis. 
Microsatellite analysis 

Microsatellites have been described in Arabidopsis (Bell and Ecker, 1994, Genomics 
25 19. 1 37-144). The present Example is based on a study of instability of microsatellites that 
are polymorphic amongst different ecotypes. DNA is extracted from the transformed calli. 
Specific primers have been defined that are used to amplify the microsatellite sequence. 
One of the two primers is previously P32 labelled by T4 kinase. In case of a polymorphic 
variation, new PCR products appear that do not follow the expected pattern of migration on 
so a polyacrylamide gel. This is a commonly observed feature for MMR deficient cells in 
yeast or mammalian cells. 

In particular, the present Example describes a study on microsatellites ca72 (CA, 8 ), 
ngal72 (GA 29 ). and ATHGENEA(A 39 ), chosen because they belong to the types 
predominantly affected in human mismatch repair deficient tumors. The size ot these 
35 microsatellites is not conserved from one Arabidopsis ecotype to the other. 

Arabidopsis cells which are transformed with an MMR altering gene (above) and 
control cells not expressing the MMR altering gene are allowed to form calli. DNA is 
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rapidly extracted from the calli and is analysed for microsatellite instability as described in 
detail by Bell and Ecker l c >94_ Genomics 137-144. In summary, the relevant 
microsatellite is amplified by PCR using P32 labelled primers. The PCR products are 
separated on a DNA sequencing uel for size determination. Size differences between 
5 microsatellites from transformed and control cells not expressing the MMR altering gene in 
question indicate microsatellite instability as a result of MMR alteration. 

The sequences of primers used for PCR amplification of microsatellites ca72 and 
ngal 72 are included in Table 1 . PCR amplification of microsatellite ATHGENEA made use 
of a forward primer containing the sequence 

i o ACCATGC ATAGCTTAAACTTCTTG (SEQ ID NO:32) 

and of a reverse primer containing the sequence 

ACATAACCACAAATAGGGGTGC (SEQ ID NO:33). 

The amplification for microsatellite ca72 revealed in Columbia control cells (with 
respect to the MMR altering gene) a 248 bp long PCR fragment instead of the published 
is length of 124 bp. DNA sequencing verified this fragment as a CA JK microsatellite. 

Forward mutagenesis assav 

Tester cells transformed with antisense AtMSHS or antisense A1MSH6 or both 
AtMSH3/AiMSH6 are analysed for the stability of the codA gene. The functional codA gene 
confers to sensitivity to 5-fluoro-cytosine (5FC). whereas a gene inactivating mutation in 

20 codA will confer resistance to 5FC. The frequency of resistant cells is therefore a good 
indicator of somatic variation as a direct result of MMR alteration. Variants resistant to 5FC 
are first analysed for GUS activity. If GUS is inactive. 5FC resistance is assumed to be due 
to gene silencing (all three genes are under the 35S promoter). If GUS is active. 5FC 
resistance is assumed to be due to forward mutations that have inactivated codA. PCR is 

25 then performed on the putative codA mutant genes which is then sequenced to confirm the 
presence of forward mutations in codA. 

Besides codA. other marker genes may also be used for the Forward Mutagenesis 
Assay such as the ALS gene (conferring sensitivity to valine or to sulfonylurea; Hervieu and 
Vaucheret, 1996, Mol. Gen. Genet. 251 220-224; Mazur et al. 1987, Plant Physiol. 55 1 1 10- 
30 1117). 

Example 3. Homeologous meiotic recombination in Arabidopsis thaliana 

A. Construction of a meioevte specific gene expression cassette comprising the DMC1 
promoter and the NOS terminator 

(i) The DMC\ promoter may be used as published by Klimyuk and Jones, 1997, 
35 Plant J. 11.1-14). To obtain a more convenient alternative for gene cloning, a 3.3 Kb 
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long subfragment of the DMC\ promoter was obtained by PCR from genomic DNA of 
Arabidopsis thaiuina (ssp. Landsberg erecta "Ler "). 
The PCR was done in three rounds. 

Round One: A 3.7 Kb long product was obtained using the forward primer 

5 DMCIN-A comprising the sequence 

GAAGCGATATTGTTCGTG (SEQ ID NO: 34) 
and the reverse primer DMCIN-B comprising the sequence 

AGATTGCGAGAACATTCC (SEQ ID NO:35). 
The weak amplification product was then used as template for round two and three. 
io Round Two: A 3.1 Kb long . product comprising the promoter and the 5' 
untranslated leader was obtained using forward primer DMCIN-1. which contained the 
sequence 

acgcgtcgacTCAGCTATGAGATTACTCGTG (SEQ ID NO:36) 
and introduced a Sa/I cloning site at the 5' end of the promoter fragment, and reverse 
is primer DMCIN-2 which contained the sequence 

gctctagaTTTCTCGCTCTAAGACTCTCT (SEQ ID NO: 37) 
and introduced a Xbal site at the 3' end of the PCR fragment. 

Round Three: A 0.2 Kb long product comprising the first exon/intron of the DMC\ 
promoter was obtained using forward primer DMCIN-3, which contained the sequence 
2 o gctctagaGCTTCTCTTAAGTAAGTGATTGAT (SEQ ID NO:38) 

and introduced a Xbal site at the 5' end of the PCR fragment, and reverse primer 

DMCIN-4, containing the sequence 

tcccccgggctcgagagatctccatggTTTCTTCAGCTCTATGAATCC (SEQ ID NO:39) 

and introduced at the 3' end of the PCR product restriction sites for Nco\. BgHl. Xhol and 
25 Smal. 

The products obtained in round Two and Three were digested with Xbal and 
subsequently ligated to reconstitute a 3.3 Kb long DMC1 promoter from which the first 
two in-frame ATG start codons were replaced with a unique restriction site for Xbal. 
This promoter can be cloned between . the restriction sites for Sa/I and Smal of p3264, 
30 which contains the Sacl-EcoRI NOS terminator in pBIN19, to yield the entire expression 
cassette in pBIN19. This cassette contains the following cloning sites: Ncoh Bgllh Xhol. 
Smal and (already present on p3264) Kpnl and Sad. 

(ii) Another strategy yielded the following convenient DMC1 promoter. A 1.8 kb 
DNA fragment comprising the 3' terminal pan of the meiocyte specific DMC1 promoter 
35 was isolated by PCR from purified genomic DNA of Arabidopsis thaliana (ssp. Landsberg 
erecta "Ler"). The forward PCR primer (DMCla) contained the sequence 
acgcgtcgacGAATTCGCAAGTGGGG (SEQ ID NO:40) 
and introduced a Sail cloning site at the 5' end of the promoter fragment. The reverse 
PCR primer (DMClb) contained the sequence 
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tecatggagatctcccgggtacCGATTTGCTTCGAGGG (SEQ ID NO:4l > 
introducing a poiylinker region at the 3' end of the promoter fragment. The PCR reaction 
was carried out with VENT DNA Polymerase (NEB) over 25 cycles using the following 
cycling protocol: 1 minute at 94°C, 1 minute at 54°C, 2 minutes at 72°C. 

5 The PCR reaction yielded a blunt ended DNA fragment which was digested with 
restriction enzyme Sail and was cloned into the cleavage sites of restriction enzymes Sail 
and Smal in plasmid p2030, a pUC118 derivative containing the Sacl-EcoRl NOS 
terminator fragment of pBIN121. The cloning yielded plasmid p2031, containing the 
DM CI promoter-poly linker-NOS terminator expression cassette depicted in Figure 18. 

10 B. Construction of an MSH3 antisense gene under the control of the DMC\ promoter 

A 2.1 kb DNA fragment encoding the carboxy terminal pan of AtMSH3 was 
removed from the poiylinker of clone 13 described in Example 1 after (i) digestion with 
Kpnl. (ii) blunting of the DNA ends generated by Kpnl and (iii) digestion with BarriHl. 
The isolated fragment was then cloned in antisense orientation downstream of the DMC1 
is promoter in plasmid p2031. which had been digested with restriction enzymes Smal and 
Bgfll. This cloning yielded plasmid p2033 (Figure 18). 

After digestion of p2033 with £c<?RL a 4.1 kb DNA fragment was recovered 
comprising the DMC\ promoter, the partial MSHi cDNA in antisense orientation with 
respect to the promoter and the NOS terminator. This fragment was cloned into the EcoRl 
20 restriction site of plant transformation vector pNOS-Hyg-SCV to yield plasmid p3242 
(Figure 18). 

C. Construction of a combined MSH6/MSH3 antisense gene under the control of a 
single DMC\ promoter 

A 3.1 kb fragment, encoding in antisense orientation the partial AtMSH6 and AtMSH3 
25 sequences and the 35S terminator, was isolated from pCW186 by digestion with Kpnl. 
The fragment was treated with Klenow enzyme to blunt both ends. It was then cloned into 
the Smal site of plasmid p3243 (a pNOS-Hyg-SCV derivative, illustrated in Figure 19), 
which had been opened to delete the region between the Smal sites. Clones containing the 
fragment in the antisense orientation with respect to the DMCl promoter (described in 
30 A(ii) above) were identified by diagnostic digestion with BamHL The selected construct 
was named p3261 . 

Another practical way of cloning the double antisense gene is as follows. A 1 kb 
DNA fragment encoding the carboxyterminal pan of AtMSH6 is isolated from clone 62 
described in Example 1 after digestion of clone 62 plasmid DNA with BamHl, which 
35 cleaves in the 5 1 poiylinker region flanking the partial cDNA, and with EcoRI, which 
cleaves within the cDNA. The isolated fragment is treated with Klenow enzyme to blunt 
both its ends and is cloned into the recipient plasmid p2033 or p3242. For the purpose of 
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cloning, (he recipient plasmid may be cleaved with either Aval or Ncol and can be blunted 
with Ktemnx enzyme to produce blunt acceptor ends for fragment cloning. This cloning 
yields two opposite orientations of cloned fragment DNA with respect to the DMCl 
promoter. These can be identified by diagnostic digestion with restriction enzymes Seal 
5 or Xmnl in conjunction with Sad. The selected construct contains the DMCl promoter, 
the combined partial cDNAs for AtMSH3 and AtMSH6 (both cloned in antisense 
orientation with respect to the DMCl promoter) and the NOS terminator. If the recipient 
plasmid is p2033. the combined antisense gene under control the single DMCl promoter is 
recovered from the vector after £coRI digestion and is cloned into the EcoRI restriction 
io site of pNOS-Hyg-SCV. 

O Construction of a full-length MSHS sense gene und er control of the DMCl promoter 
for nverexoression of functional MSH 3 protein 

Overexpression of MSH3 protein was shown in human cells (Marra et al., 1998, 
Proc. Natl. Acad. Sci. USA 95. 8568-8573) to complex all available MSH1 protein. This 
15 leaves MSH6 protein without its partner, leading to the degradation of MSH6 protein and 
eventually to a mismatch repair phenotype. 

This phenomenon is exploited to increase homeologous meiotic recombination in 
Arabidopsis as an alternative to antisense inhibition of AtMSH6. For this purpose the full- 
length cDNA encoding AtMSHl is isolated from plasmid pPF66 by digestion with Smal 
20 and is cloned into the Smal site of the DMCl expression cassettes described in A(i). 

F. Selection of Recombination markers on homeologou s chromosomes of Arabidopsis 
rhnlinnn subspecies I^indsbere ere.cta (LerV C olumbia (Col) and C24, respectively 

Efi). Visual recombination markers in Arabidop sis th. subspecies C24: 

Agrobacterium mediated transformation with a T-DNA containing a 35S-GUS gene, 
25 inactivated by insertion of a 35S-Ac transposable element (Finnegan et al.. 1993, Plant 
Mol. Biol. 22, 625-633), had yielded a C24 line in which the T-DNA construct was 
integrated into chromosome 2. Genetic and molecular analysis of this line shows that the 
Ac transposon had excised from its T-DNA locus thereby restoring GUS activity, but had 
re-inserted into the chromosome at a distance of 16.4 cM, where it stayed fixed (due to 
30 disablement of Ac) within the chlorina gene. Insertional inactivation of the chlorina gene 
caused a bleached phenotype in those plants that were homozygous for this mutation. 
Because of the two linked phenotypic markers. chlorina3A:Ac and GUS, this C24 line was 
used in crosses to wildtype Ler for analysis of meiotic homeologous recombination on 
chromosome 2 in conjunction with molecular recombination markers. 

35 E(ii). Visual recombination markers in Arabido psis th. Ler. 

The Ler line NW1 (obtained from NASC. Nottingham. UK) contains one recessive 
visual marker per chromosome, i.e. an-1 on Chr.l, py-1 on Chr.2, gll-1 on Chr.3. cer2-l 
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on Chr.4. and msl-l on Chr.5. This line is used in crosses to wildtype C24 which 
expresses an MMR altering gene tor analysis of meiotic homeologous recombination on 
chromosomes 1-5 in conjunction with molecular recombination markers listed in Table 1. 
Other Ler lines from NASC have several visual markers in close proximity to each 
s other on the same chromosome. When these lines are used for hybrid production, analysis 
of homeologous meiotic recombination may make use entirely of visual recombination 
markers. Particularly suitable for crossing to C24 wildtype that is expressing a MMR 
altering gene are the following Ler lines: 

NW22: relative markers are disl - (4 cM) - ga4 - (1 1 cM) - thl on chromosome 1. 
io NW10: relevant markers are tz-201 - (5 cM) - cer3 on chromosome 5. 
NW134, relevant markers are ag - (4 cM) - ga3 on chromosome 5. 
NW24 (abi3-l) and NW64 (gll-\). When present in the same plant on chromosome 
3. abi3-\ and gll-\ are 13 cM apart. Since this marker combination is not available from 
NASC. we have combined these markers by crossing line NW24 to line NW64. The Fl 
15 offspring were allowed to self-fertilise and to produce F2 seeds. F2 Plants which carry 
both markers as homozygous traits on the same chromosome can be identified firstly, by 
germinating F2 seeds on germination medium containing selective concentrations of 
abscisic acid, and subsequently, by identifying among the abscisic acid resistant plants 
those individuals which show the glabra phenotype. 
20 FiiiH Molecular recombination mark ers in Col. Ler and C24: 

The genome of Arabidopsis thaliana is interspersed with unique base sequences 
arranged as simple tandem repeats. Allelic repeats can vary in length between different 
Arabidopsis subspecies and when amplified by PGR yield diagnostic DNA products of 
different length named Simple Sequence Length Polymorphisms (SSLPs). Many SSLPs 
25 have been genetically mapped and have been assigned to unique chromosome locations on 
the recombinant inbred map (Bell and Ecker. 1994. Genomics 19, 137-144; Lister and 
Deans lines. Weeds World 4i, May 1997). 

In Table 1 are listed 28 mapped and established SSLPs between Ler and Col. A 
number of PCR primer pairs are described herein (SEQ ID NO:42 to SEQ ID NO:97) 
30 which also yielded SSLPs between C24 and Ler (19 SSLPs) or between C24 and Col (25 
SSLPs), respectively. Polymorphic SSLPs can be used as molecular markers in the 
analysis of homeologous recombination between genomes from these subspecies. 

The PCR reactions (25 uL) were carried out over 35 cycles (15 seconds at 94°C, 30 
seconds at 55°C and 30 seconds at 72°C), with 0.25 U Taq DNA polymerase and 0.6 /xg 
35 genomic DNA in reaction buffer containing 2 mM MgCl 2 - PCR products were separated 
by agarose gel electrophoresis (4% ultra high resolution agarose) and visualised by 
ethidiumbromide staining. The results from the PCR experiments are summarised in 
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Table 1. which also shows the sequence of PCR primers, primer annealing temperature 
(Tm). PCR product length and chromosome location of SSLP (with respect to the RI map 
of May 1997, Weeds World 4i). 
F Production o f hvhrid plants 
5 C24 plants heterozygous for chlorina3A:Ac/GUS are crossed as male to emasculated 
wildtype Ler to produce Ler/C24(chlorina3A, CUS) hybrid seeds. 

Due to the heterozygosity of the C24 parent, only 50 % of hybrid plants have 
inherited the chlorina3A:Ac/GUS locus. The remaining 50% of hybrid plants are wildtype 
with respect to chlorina3A:Ac/GVS. Since the mutant locus is linked to a kanamycin 
io resistance gene (contained on the same T-DNA as GUS) mutant plants can be pre-selected 
by germinating hybrid seeds on germination medium containing 50 mg/L kanamycin. 

Ler plants homozygous for the five chromosome markers are male sterile (msl-1) 
and are crossed without emasculation to wildtype C24 to produce Ler(an-I. py-l. gll-1. 
cer2-l. msl-l)IC24 hybrid seeds. 
15 Other Ler plants, which are male fertile, are crossed after emasculation of the female 
parent to produce LeriC24 hybrid seeds. 

r. introduction of MSfK and MSHM* antisense r e n e s i nto Arabi dopsis and analysis of 

meiotic homeologou s recombination 

Transformation nf hvhrid pl a nt, and analysis of homeologous meiotic recombination 
20 The plant transformation vectors comprising the antisense genes described in (B) and 

(C) above are introduced into Agrobacterium tumefaciens strain AGLX (Lazo et a!.. 1991. 

Bio/Technology 9, 963-967) by electroporation. Recombinant Agrobacterium clones are 

selected on LB medium containing 50 mg/L rifampicin and 100 mg/L carbenicillin. 

Selected clones are used to infect roots of Arabidopsis hybrid plants (described in (F) 
25 above) using the root transformation protocol of Valvekens et al. (1988. PNAS 85. 5536- 

5540) except that shoot and root inducing media contain hygromycin (10 mg/L) instead of 

kanamycin. 

Plants regenerated from roots of hybrid plants are genetic clones of root donating 
plants and therefore are again genetic hybrids of two Arabidopsis subspecies described in 
30 (F). However, in contrast to the root donating plants, the regenerated hybrid plants also 
contain the introduced transgene and the co-introduced hygromycin resistance gene with 
the latter allowing these plants to grow on hygromycin containing culture medium. 

Hygromycin resistant plants are then allowed to enter the reproductive phase and to 
produce gametes by meiotic divisions of microspore and megasppre mothercells. At 
35 meiosis, the DMC\ promoter is activated and can direct the expression of antisense genes 
described in (B) and (C) above, leading to decreased MSH6 and/or MSHi gene 
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expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 protein, 
thus causing alteration of MMR during meiotic divisions and increasing the recombination 
frequency between homeologous chromosomes. 

Transgenic plants are then allowed to self-fertilise and to produce seeds. These 
5 seeds (F2 seeds with respect to hybrid production), and the plants derived therefrom, 
carry the homeologous recombination events which can be identified by using the visual 
and molecular recombination markers described in (E) above. 

In case of homeologous recombination between chromosomes of Ler and 
C24(chlorina3A:Ac. GUS). the analysis concentrates on chromosome 2 by selecting plants 
,o showing the visual phenotypic marker chlorine. This marker thus serves as a reference 
point as it indicates that respective chromosomes 2 originate from C24. Other markers, 
such as CUS or molecular markers, on chromosome 2 may then be used to identify 
chromosomal regions which are derived from the Ler chromosome as a result of 
homeologous recombination. F2 plants of control transformants not expressing the 
15 antisense" gene(s) can be analysed in parallel and the results can be used for comparison to 
homeologous recombination results obtained in antisense plants. 

«\\ Transformation of C7<\ wildtvpe hybrid plant production and analysis of homeologous 
meiotic recombination 

Introduction of MMR altering genes into wildtype C24 is done using the root 
2 o transformation protocol as described in G(i) for transformation of hybrid plants. 
Transformed plants are selected by resistance to either 10 mg/L hygromycin (in case of 
transformation with T-DNA's derived from pNOS-Hyg-SCV) or to 50 mg/L kanamycin 
(in case of transformation with T-DNA's derived from pBIN19). 

Transgenic plants are then allowed to self-fertilise and to produce seeds (Tl seeds). 
25 Segregation of the antibiotic resistance gene in the Tl population then indicates the 
number of transgene loci. Lines with a single transgene locus (indicated by a 3:1 ratio of 
resistant:sensitive plants) are selected and are bred to homozygosity. This is done by 
collecting selfed seeds (T2) from Tl plants and subsequent testing of at least four 
independent T2 seed populations for segregation of the antibiotic resistance gene. T2 
30 populations which do not segregate the antibiotic resistance gene are assumed to be 
homozygous for both the resistance gene and the linked MMR altering gene. 

C24 plants homozygous for the MMR altering gene are then crossed to Ler lines 
homozygous for recessive visual markers (see E(ii)) to produce C24/Ler hybrid plants as 
described in (F). These Fl hybrids are then allowed to enter the reproductive phase and to 
as produce gametes by meiotic division of microspore and megaspore mothercells. At 
meiosis. the DMC 1 promoter is activated and can direct the expression of antisense or 
sense genes described in (B), (C) and (D) above, leading to decreased MSH6 and/or MSH3 
gene expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 
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protein, thus causing alteration of MMR during meiotic divisions and increasing the 
recombination frequency between the homeoiogous chromosomes of C24 and Ler. 
Recombination events are then scored in the F2 generation. 

For recombination analysis, the hybrid plants are allowed to self-fertilise and to 
5 produce F2 seeds. F2 plants are then preselected for a first visual marker. Since this 
marker is recessive, its visual presence indicates homozygosity for Ler DNA at the 
relevant locus. Those F2 plants which show this First visual marker are then analysed for 
the presence or absence of a second visual marker which in the Ler parent is closely linked 
to the first marker. Absence of the second visual marker indicates recombination between 
io the relevant C24 and Ler chromosomes between the first and second marker. The 
frequency of recombination in transgenic hybrids is compared to the recombination 
frequency in control hybrids not expressing the MMR altering gene. 

Examples of recombination analysis are the following. 

The Ler line NW22(d/5/. ga4 . thl) is used for crosses to transformed C24. 

is F2 plants are preselected first for thiamine requirement (thl) and then are further 

analysed for re-appearance of wildtype height (loss of ga4) and/or re-appearance of 
normal trichomes (loss of disl) as a result of recombination. 

The Ler line NWlOUz-207, cer3 ) is used for crosses to transformedC24. 

F2 plants are then preselected first for thiazole requirement (tz) and then are further 
20 analysed for re-appearance of normal, i.e. non-shiny stems (loss of cer3) as a result of 
recombination. 

The Ler line NW134 (ng , ga3 ) is used for crosses to transformed C24. F2 plants 
are first preselected for dwarfish appearance (ga3) and are then analysed for re-appearance 
of trichomes (loss of ng) as a result of recombination. 

25 Ler plants homozygous for abi3-\ and gll-\ are used for crosses to transformedC24. 
F2 plants are first preselected for their ability to germinate in the presence of abscisic acid 
and are then analysed for re-appearance of trichomes on the leaves (loss of gll-l) as a 
result of recombination. 

In the case of homeoiogous recombination between transformed C24 and the Ler line 
30 NW1 (an-L py-1* gll-l- cer2-\, msl-l) y recombination analysis is similar the one 
described above, except that the second marker is not a visual marker but has to be a 
molecular marker. This is because the Ler parent carries only one visual marker per 
chromosome. 
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CLAIMS 

1. An isolated and purified DNA molecule comprising a polynucleotide sequence 
encoding a polypeptide functionally involved in the DNA mismatch repair system of a 
plant. 

5 2. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to a mismatch repair polypeptide of a yeast or of a human. 

3. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

4. An isolated and purified polypeptide functionally involved in the DNA 
io mismatch repair system of a plant. 

5. A polypeptide according to claim 4 which is homologous to a mismatch repair 
polypeptide of a yeast or of a human. 

6. An isolated and purified polypeptide selected from the group consisting of a 
polypeptide encoded by the gene AtMSH3 (SEQ ID NO: 18), a polypeptide encoded by the 

is gene AtMSH6 (SEQ ID NO:30), polypeptides homologous to a polypeptide encoded by the 
gene AtMSH3 (SEQ ID NO: 18) and polypeptides homologous to a polypeptide encoded 
by the gene AtMSH6 (SEQ ID NO:30). 

7. An isolated and purified DNA molecule comprising a polynucleotide sequence 
selected from the group consisting of (i) a sequence encoding a polynucleotide which is 

20 capable of interfering with the expression of a plant polynucleotide sequence encoding a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or of a 
human and thereby disabling said plant polynucleotide sequence; and (ii) a sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 

8. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
25 encoding a polynucleotide capable of interfering with the expression of a plant 

polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence. 

9. A DNA molecule according to claim 8 wherein said polynucleotide is capable 
30 of interfering with the expression of a plant polynucleotide sequence is a sense 

polynucleotide, an antisense polynucleotide or a ribozyme. 

10. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 
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U. A DNA molecule according to claim 10 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

12. A DNA molecule according to claim 10 further comprising a regulation 
element capable of causing overexpression of said polypeptide in a cell of said plant. 

5 13. A chimeric gene comprising: 

a DNA sequence selected from the group consisting of (i) a sequence encoding a 
polynucleotide capable of interfering with the expression of a plant polynucleotide 
sequence encoding a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human and thereby disabling said plant polynucleotide sequence, and (ii) a 

io sequence encoding a polypeptide capable of disrupting the DNA mismatch repair system 
of a plant; and 

at least one regulation element capable of functioning in a plant cell. 
14. A chimeric gene according to claim 13 wherein said regulation element is 
selected from constitutive, inducible, tissue type specific and cell type specific promoters. 
, 5 15. A chimeric gene according to claim 13 comprising a DNA sequence encoding 
a polypeptide capable of disrupting the DNA mismatch repair system of a plant, wherein 
said regulation element is capable of causing overexpression of said polypeptide in a cell 
of said plant. 

16. A chimeric gene according to claim 13 wherein said regulation element is 
20 selected from the group consisting of 35S, NOS. PRla. AoPRl and DMC1. 

17. A plasmid or vector comprising a chimeric gene according to any one of 
claims 13-16. 

18. A plant cell stably transformed, transfected or electroporated with a plasmid or 
vector according to claim 17. 

25 19. A plant comprising a cell according to claim 18. 

20. A plant according to claim 19 selected from plants of the families 
Brassicaceae, Poaceae, Solanaceae, Asteraceae, Malvaceae, Fabaceae. Linaceae, 
Canabinaceae, Dauaceae and Cucurbitaceae. 

21 . A process for at least partially inactivating a DNA mismatch repair system of a 
30 plant cell, comprising transforming or transfecting said plant cell with a DNA molecule 

according to any one of claims 1-3 or 7-12 and causing said DNA sequence to express 
said polynucleotide or said polypeptide. 

22. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a chimeric gene 
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according to any one of claims 13-16 and causing said DNA sequence to express said 
polynucleotide or said polypeptide. 

23. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a plasmid or vector 

s according to claim 17 and causing said DNA sequence to express said polynucleotide or 
said polypeptide. 

24. A process for increasing genetic variation in a plant comprising obtaining a 
hybrid plant from a first plant and a second plant, or cells thereof, said first and second 
plants being genetically different; altering the mismatch repair system in said hybrid plant; 

10 permitting said hybrid plant to self-fertilise and produce offspring plants; and screening 
said offspring plants for plants in which homeologous recombination has occurred. 

25. A process according to claim 24 wherein a first gene is incapacitated in said 
first plant, a second gene is incapacitated in said second plant, and said first and second 
genes are incapacitated in said hybrid plant thereby altering the mismatch repair system of 

is said hybrid plant. 

25. A process according to claim 25 wherein said incapacitation of the mismatch 
repair system of said hybrid plant is reversible. 

26. A process according to claim 24 wherein a new genetic linkage of a desired 
characteristic trait or of a gene which contributes to a desired characteristic trait is 

20 observable in at least one of said offspring plants. 

27. A process for obtaining a plant having a desired characteristic, comprising 
altering the mismatch repair system in a plant, cell or plurality of cells of a plant which 
does not have said desired characteristic, permitting mutations to persist in said cells to 
produce mutated plant cells, deriving plants from said mutated plant cells, and screening 

25 said plants for a plant having said desired characteristic. 

28. A process according to claim 27 wherein said step of altering the mismatch 
repair system comprises introducing into said hybrid plant, plant, cell or cells a chimeric 
gene according to claim 13 and permitting the chimeric gene to express a polynucleotide 
which is capable of interfering with the expression of a plant polynucleotide sequence in a 

30 mismatch repair gene of the hybrid plant, plant, cell or cells, or a polypeptide capable of 
disrupting the DNA mismatch repair system of the hybrid plant, cell or cells. 

29. A process according to claim 28 comprising inactivating an MSH3 gene and/or 
an MSH6 gene of said plant. 

30. A process according to claim 28 comprising inactivating an MSH3 gene and an 
35 MSH6 gene of said plant. 
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31. A process according to claim 27 comprising at least partially inactivating the 
mismatch repair system of said plant in a predetermined cell type or in a predetermined 
tissue of said plant. 

32. A process according to claim 31 further comprising restoring mismatch repair 
5 in said cell type or said tissue. 

33. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to a DNA molecule according to claim 1 with the proviso that said 
oligonucleotide is other than SEQ ID NO:l or SEQ ID NO:2. 

34. An oligonucleotide capable of hybridising at 45°C under standard PCR 
10 conditions to the DNA of SEQ ID NO: 18 with the proviso that said oligonucleotide is 

other than SEQ ID NO: 1 or SEQ ID NO:2. 

35. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO:30 with the proviso that said oligonucleotide is 
other than SEQ ID NO: 1 or SEQ ID NO:2. 
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Figure 2 



Comments/References: 52= 3* side of S5 (AtMSH3) 1244bp in pUCl8/Sma1 
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Figure 3 

Comments/References: 13 = 3* side of S5 (AtMSH3) 2104bp in pUCl8/Smal 
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Figure 7 

Comments/References: 43= 5" side of S8 (AtMSH6) 2182 bp tn P UCl8/Sma1 
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Figure 8 



Comments/References: 62= 3' side of S8 (AtMSH6) 1379bp in pUCl8/Smal 
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T77777GG77GC7AACAA7AAAGG7A7ACGG777TA7G7CA7CAA7A7AA 
CTATATATAAAAGAAATGAAAGATATATATTGTTTTTTCATTTATCAAAC 
AAAACAACAAGACTTTTTTTTTACTTTTTACATTGGTCAACAAAATACAA 
GATAAACGACATCGTTTAATCATTTCCCAATTTTACCCCTAAGTTTAACA 
CCTAGAACCTTCTCCATCTTCGCAAGCACAGCCTGATTAGGAACAGCTTT 
ACCATTCTCATATTCCTGAACTACCTGAGTCCTCTCATTGATCTGTTTCG 
CCAAATCCGCTTGTGACATCTTCTTCTCCAATCTCGCTTTCTGTATCATC 
AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 
TTCCAGCTTCTTCGTGTTAATCACCGGAACCGCCGTAGATTTCCCCTTTT 
TGTTCGAACCGGCATCGAATTTCTTAACCGTTTGAACCGCGACACCGTTT 
CTCAGAGCTGCGTTAACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 
TTTTGATTTGTGGAGAACTACTGGTTCCCAGTCTTGTGTTACTGCTCCTG 
GGTATCTGCTCGGCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 
ATTTTATTAATCTGAGTTTTGAAATTGAGAAACGATGAAGATGAAGAATG 
TTGTTGAGAGGATTGTGATATTTATATATACGAAGATTGGTTTCTGGAGA 
ATTCGATCATCTTTTTCTCCATTTTCG7CTC7GGAACGTTCTTAGAGATG 
ATTGACGACGTGTCATTATCTGATTTGCAGTTAACCAATGCTTTTTGGGT 
7GGATTCGTGGTACACCATATTATCCGA7TTGGCTCAATGGTTTTATATA 
AA777GG7777CGG77CGG77A7GAG77A7CA77AAAA7TAAGCTAACCA 
AAAA7777CG7AAAAT77A777CGG777CAA77CGGA7CCC77ACT7CCA 
GAACCGAATTA77CGAAACCGGGG77AGCCGAACCGAATACCAATGCC7G 
A77GAC7CG77GGC7AGAAAGA7CCAACGG7A7ACAATAA7AGAACA7AA 
A7CGGACGG7CA7CAAAGCC7CAAAGAG7GAACAG7CAACAAAAAAAG77 
GAGCCC7GAGGAG7A7CG777CCGCCA777C7ACGACGCAAGGCGAAAA7 
T777GGCGCCAA7C777CCCCCC777CGAA77C7C7CAGC7CAAAACA7C 
G777C7C7C7CAC7C7C7C7CACAA77CCAAAAAA7GCAGCGCCAGAGA7 
CGA7777G7C777C77CCAAAAACCCACGGCGGCGAC7ACGAAGGG777G 
G777CCGGCGA7GC7GC7AGCGGCGGGGGCGGCAGCGGAGACCACGAT77 
AA7G7GAAGGAAGGGGA7GC7AAAGGCGACGCT7C7G7ACG7777GC7G7 
7TCGAAATC7G7CGATGAGG77AGAGGAACGGA7AC7CCACCGGAGAAGG 
77CCGCGTCG7G7CC7GCCGTC7GGA7TTAAGCCGGC7GAATCCGCCGG7 
GA7GC7TCG7CCC7G77C7CCAA7A7TATGCA7AAG777G7AAAAGTCGA 
7GATCGAGA77G77CTGGAGAGAGG7AC7AATCT7CGA77C7CTTAATT7 
7G77A7C7T7AGC7GGAAGAAGAAGA7TCGTG7AA777G7TG7A7TCGTT 
GGAGAGATTC7GA77AC7GCA7TGGATCGTTGTTTACAAA7777CAGGAG 
CCGAGAAGA7G77G77CCGCTGAA7GATTCATCTC7A7GTATGAAGGCTA 
A7GA7G77A77CC7CAA777CG77CCAA7AATGG7AAAAC7CAAGAAAGA 
AACCA7GC7777AG777CAG7GGGAGAGC7GAAC77AGA7CAG7AGAAGA 
7A7AGGAG7AGA7GGCGA7GT7CC7GG7CCAGAAACACCAGGGATGCG7C 
CACG7GCT7C7CGC77GAAGCGAG77C7GGAGGATGAAA7GACTTTTAAG 
GAGGA7AAGG77CC7G7A77GGACTCTAACAAAAGGC7GAAAA7GCTCCA 
GGA7CCGG77TG7GGAGAGAAGAAAGAAG7AAACGAAGGAACCAAATTTG 
AATGGG7TGAG7C77C7CGAATCAGGGATGCCAATAGAAGACGTCCTGAT 
GA7CCCC777ACGA7AGAAAGACC77ACACA7ACCACCTGA7G7TTTCAA 
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GAAAA7G7C7GCA7CACAAAAGCAA7A77GGAC7G77AAGAG7GAA7A7A 

TGGACATTGTGCTTTTCTTTAAAGTGGTTAGTAACTATTAATCTAGTGTT 

CAATCCATTTCCTCAATGTGATTTGTTCACTTACATCTGTTTACGTTATG 

CTCTTCTCAGGGGAAATTTTATGAGCTGTATGAGCTAGATGCGGAATTAG 

GTCACAAGGAGCTTGACTGGAAGATGACCATGAGTGGTGTGGGAAAATGC 

AGACAGGTAAATTAGTTGAAACAACTGGCCTGCTTGAATTATTGTGTCTA 

TAAATTTTGACACCACCTTTTGTTTCAGGTTGGTATCTCTGAAAGTGGGA 

TAGATGAGGCAGTGCAAAAGCTATTAGCTCGTGGGTAAGGGAACCATCAT 

ACTTTATGGAATTCGTTTACTGCTACTTCGGCTAGGATTTAAGAAATGGA 

AATCACTTCAAGCATCATTAGTTAGGATCCTGAGAACTCAGGATGTTTTC 

TTATTCGTTATATAATAAGTCTTTTCATCAAGGAGTAACAAACAAAACTT 

GCACAATATTTGTGTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 

TGCTAGTTCACTGTAGTAACAGTTACGGATAATATATGTTTACT7GTATG 

TGG7ACCC7CA7777G7C7C7CA7GGAGGC777CAAGCC77G7G77GAAA 

CTGGATAGTTACATATGC7TCCAACAGAAACTAGCATGCAGATTCA7ATG 

CTTTCCTAT7C7ACTAA77A7G7A77GACACACTCG7TG7TTCT7TTGAA 

AGA7A7AAAG77GGACGAA7CGAGCAGC7AGAAACA7CTGACCAAGCAAA 

AGCCAGAGG7GC7AA7ACTG7AAG7777C7TGGA7AGGTCAAGGAGAG7G 

77GCAGAC7G77777GA7CA777C7777777G7ACA77AC777CA7GC7G 

TAATTAACTCAA7GGC7AT7CTGG7C73ATTATCAGATAATTCCAAGGAA 

GCTAG77CAGG7A77AACTCCA7CAACAGCAAGCGAGGGAAACA7CGGGC 

CTGATGCCG7CCATCTTCTTGC7A7AAAAGAGGTT7GTTATTTACTTATT 

7A7C77A7CA7G77CAG77CA7CCAAG7CC7GAAAAA77ACAC7G77C77 

TACCAA7C77CCA7CAAGC7G7G7AAAGGA777GGAA77AGAAAA7CA77 

A777GA7GC777G7777A7A7GCAAGAGG77CCC7TGAAAAGA7C7G777 

AAGA77C777GCAC77GAAAAA77CAA7C77777AAG7GAA7CGCC7AC7 

TTCTTACAATGA7CATAGTCTGCAA77GCATGTCAAGTAATA7CA77CCT 

TG77AC7GCATCCCCC7C777C77AA7GAQCA77G7C7A7G77G7G777G 

7C7CG7G7GC7GGAGAAAA7GA7AGC7GA7CCAAGC7G7ACA77A7CA7G 

A7TAAG7AGC7GC7CAGGAA77GCC777GG77ACA77GCC7AA7GG777G 

A7G7CAA77777C77CTGAA7C777A7777AGA7CAAAA7GGAGC7ACAA 

AAG7G77CAAC7G7G7A7GGA777GC7777G77GAC7G7GC7GCC77GAG 

G7777GGG77GGG7CCA7CAGCGA7GA7GCA7CA7G7GC7GC7C77GGAG 

CG77A77GA7GCAGG7AAGCAAG7G7A77C7G7A7C77A7G7G7ACCA7G 

7GAC77CC7G7GCA7A7A777GGG77GCAGGAAC7AA77C7GAA7CACCA 

77TGG7A7G777777CCAGG777C7CCAAAGGAAG7G77A7A7GACAG7A 

AAGG7AAAC7GC77G7A7CGCCAG77G7777G77AAACAGAA777AAGG7 

AAA7 GACAC7 GG7TAA777 AAAG7GCA7 ACA7G7 7 GAAA7 A7 7G C AGGGC 
7A7CAAGAGAAGCACAAAAGGC7C7AAGGAAA7A7ACG77GACAGG7ACC 
A777CAG7AGGCAAGC7AAC7GACAA777AACCGC7CACCGAA7GA7AGG 
7C7C77AAACA77GC7AA7G7AGA7GA7G777A7G777CAA7C7AA7AGG 
G7C7ACGGCGG7ACAG77GGC7CCAG7ACCACAAG7AATGGGGGA7ACAG 
A7GC7GC7GGAG77AGAAA7ATAA7AGAA7C7AACGGA7AC777AAAGG7 
7C77C7GAA7CA7GGAAC7G7GC7G77GA7GG7C7AAATGAA7G7GA7G7 

Figure 11 (Continued) 
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TGCCC77AG7GC7C77GGAGAGC7AA77AATCA7C7G7C7AGGC7AAAGG 
TGTGTTGGCTTGTTTAGTTTTTGCTTTTCACAAATTAAGCAAAGGAACTT 
TTCATAACTTACAGTTTCTATCTACTTGCAGCTAGAAGATGTACTTAAGC 
ATGGGGATATTTTTCCATACCAAGTTTACAGGGGTTGTCTCAGAAT7GAT 
GGCCAGACGATGGTAAATCTTGAGATATTTAACAATAGC7GTGATGGTGG 
TCCTTCAGGCAAGTGCATATTTCTTTTTTGATAACTTCAACTAGAGGGCA 
GACATAGAAGGAAAAATTCTAATACTTCGTACGGATCTCCAGTAAGTAAT 
AGCCGATTTTTGTTTACCTATGTAGGGACCTTGTACAAATATCTTGATAA 
CTGTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGGATCTGCCATC 
CACTCAAAGATGTAGAAAGCATCAATAAACGGCTTGATGTAGTTGAAGAA 
TTCACGGCAAACTCAGAAAGTATGCAAATCACTGGCCAGTATCTCCACAA 
ACTTCCAGACTTAGAAAGACTGCTCGGACGCATCAAGTCTAGCGTTCGAT 
CATCAGCCTCTGTGTTGCCTGCTCTTCTGGGGAAAAAAGTGCTGAAACAA 
CGAGTAAGTATCAATCACAAGTTTTCTGAGTAATGCCTTCCATGAGTAGT 
ATAGGACTAAAACATTACGGGTCTAGCTAAAGACTGTTCTCCTTCT7TTG • 
CAATG7CTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGGTT 
AAAGCA77TGGGCAAA77G7GAAAGGG77CAGAAGTGGAA77GA7C7G7T 
G77GGC7C7ACAGAAGGAA7CAAATA7GA7GAG777GC777A7AAAC7CT 
G7AAAC7TCC7ATATTAGTAGGAAAAAGCGGGCTAGAG77ATTTC77TCT 
CAA77CGAAGCAGCCA7AGA7AGCGAC777CCAAA77A7CAGG7GCCCA7 
C7A7CT77CA7AC777ACAACAAAA7G7C7G7CAC7AC7CAAAGCAA7GC 
A7A7GGC77AGA7CTCAAC7CACACCCCGAGGA7CC7AAAGGGA7T7GC7 
7777A77CC7AA7G77777GGA7GG777GA777A777C7AAC77GAAC7T 
AT7AA7C77G7ACCAGAACCAAGA7G7GACAGA7GAAAACGC7GAAAC7C 
TCACAA7ACT7A7CGAAC77777A7CGAAAGAGCAAC7CAA7GG7C7GAG 
GTCA77CACACCATAAGC7GCC7AGA7G7CC7GAGA7C7777GCAA7CGC 
AGCAAG7C7C7CTGC7GGAAGCA7GGCCAGGCC7G77A77777CCCGAA7 
CAGAAGC7ACAGATCAGAA7CAGAAAACAAAAGGGCCAATAC77AAAA7C 
CAAGGAC7ATGGCA7CCA777GCAG77GCAGCCGA7GG7CAA77GCC7G7 
7CCGAATGATATAC7CCTTGGCGAGGC7AGAAGAAGCAGTGGCAGCA7TC 
ATCC7CGGTCATTG77AC7GACGGGACCAAACA7GGGCGGAAAA7CAACT 
C77C7TCGTGCAACATGTCTGGCCG77A7CT7TGCCCAAGT77G7A7ACT 
CGTTAGATAATTACTCTATTCT7TGCAATCAG7TC77CAACA7GAATAAT 
AAA7TC7GTT7TCTG7CTGCAGC77GGC7GCTACG7GCCG7GTGAGTCTT 
GCGAAATCTCCCTCG7GGATACTATCTTCACAAGGC77GGCGCA7C7GAT 
AGAATCATGACAGGAGAGAGTAAG7T77G77CTCAAAATACCAATTCCTC 
GAAC7A77TACTCAGATTTTGTC7GA77GGACAAGGTGG77T7GC7T7TT 
7T7AGG7ACC7TTT7GG7AGAA7GCAC7GAGACAGCG7CAGT7CTTCAGA 
A7GCAAC7CAGGAT7CAC7AG7AA7CC77GACGAAC7GGGCAGAGGAAC7 
AGTACTTTCGATGGA7ACGCCAT7GCA7ACTCGGTAACCTGCTCTTCTCC 
7TCAAC7TATACTTGTTGATCAACAAAAACATGCAA77CA7T7TGCTGAA 
ACTTATTGATTTATATCAGGTTTTTCGTCACCTGG7AGAGAAAGTTCAAT 
GTCGGATGCTCTTTGCAACACATTACCACCCTCTCACCAAGGAATTCGCG 
7C7CACCCACG7GTCACC7CGAAACACA7GGCTTGCGCA7TCAAATCAAG 

Figure 11 (Continued) 
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A7C7GA77A7CAACCACG7GG77G7GA7CAAGACC7 AG7G77C77G7ACC 6650 

GT7TAACCGAGGGAGCTTGTCCTGAGAGCTACGGACTTCAAGTGGCACTC 6700 

ATGGCTGGAATACCAAACCAAGTGG7TGAAAC AGCATCAGGTGCTGCTCA 6750 

AGCCATGAAGAGATCAATTGGGGAAAACTTCAAGTCAAGTGAGCTAAGAT 6900 

CTGAGTTCTCAAGTCTGCATGAAGACTGGCTCAAGTCATTGGTGGGTATT 6850 

TCTCGAGTCGCCCACAACAATGCCCCCATTGGCGAAGATGACTACGACAC 6 900 

TTTGTTTTGCTTATGGCATGAGATCAAATCCTCTTACTGTGTTCCCAAAT 6950 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT 7 00 0 

CTGATGTTTATTCCTCTTAAAAAATGCTTATATATCAAAAAATTGTTTCC 7 050 

TCGATTATAACAAGATTATATATGTATCTGTCGGTTTAGCTATGGTATAT 7100 

AATAT ATGT ATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 7 1 5 0 

7A77AAGAAGGAAA7A7G777A7GCA7TAAT77AAG777CAAGATAAACT 7200 

GCAAATAACCTCGACTAAAGTTGCAAAGACCAAACACAAATTACAAAACT 7 250 

TATAAGACTTAAGTTCTGAATTCCCTAAAACCAAAAAAAAAAACAGAACA 7 300 

TATTTTGTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 7 350 

TCATCACTGAGATTAACATCAGAATCATTCTCCATTTCTTCATCTTCACT 7 400 

CTCATCATCATCACCACCACCATGATGATTCTCCTCCTCTTCACGT AACC 7 4 5 0 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCTTCTGCAACTCC 7 500 

AAATCTCTCTGAAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 7 550 

TTTCTTACTCTTCTCCATCTTCTCATAAACCTTCCCAAACCTCTCAACAG 7 600 

AATCCGCCAACATCTTATACGAAGCAGCGTCATTAACCTTCTTCCTCTCG 7 650 

TACTCAACCTCATCATCCTCATCCTCCTCCTCTTCAGAATCACCAGGACT 7 7 00 

ATCCATCATCTCATCAAACCCATTAGACTTATCTAAATAAACCTTAGTGT 77 50 

TCATAAACACAAACTCACCTGAATCAAC ACCACAAGC7AAACCTAAATCC 7 3 0 0 

GACTTGGGCGAAACACAAAGCAACATATCCAACTTATTGAAAAACGACCA 7 850 

TTTACTTGAACCTAAACCTGATTTC7CAACCTTAATCTTCTCTTTTCTAT 7 900 

AC77CC7C77CAAG7CA7CAA7CA77C7CCTACA7TGCG7C7CAGA777C 7 950 

7CCA7CC77AGC7CC7CAC7CAC777C7CAGC7ACTTCA77CCAA7CC7C 8000 

G77CCTCAAACTCC77C7ACCCAA77GCAAAAACCTA7C7CCCCAAAC77 8050 

CAAGCAACACAA 8062 

Figure 11 (Continued) 
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Figure 12 



Comments/References: AtMSH3 3' side antisense : AtMSH3 3' (13 = 2104bp) from 
pUCta/13 Sal1/Sst1/T4 into pCW164 BamH1/T4 in Agrobacterium LBA4404 
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Figure 13 



Comments/References: AtMSH6 (S8) 3' side antisens : 62 Sal1/Sst1/T4 (1379bp) 
into pCWl64 BamH1/T4 



WSDOCID: <WO 9919492A2 I > 



SUBSTITUTE SHEET (RULE 26) 



WO 99/19492 



23 / 28 



PCT/EP98/06977 




8210 EcoRl.Sacl.Kpnl 



Figure 14 



Comments/References: A1MSH6 37AtMSH3 3' antisenso : AtMSH6 (S8) 3* side (62=1379bp) 
Sal1/Sst1/T4 into pPF13 (pCWl64 AtMSH3 (S5) 3' side (13=2104) antisens)/Sma1. in 
LBA4404 
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8210 EcoRI.Sad. Kpn1 



Figure 15 



Comments/References: AtMSH3 37AtMSH6 3' antisens (D) : AtMSH3 (S5) 3' side 
(13=2104bp) Sal1/Sst1/T4 into pPFU (AtMSH6 (SB) 3'side (62=l379bp) antisense into 
pCW164)/Smal. in LBA4404 



ysiSDOClD: <WO 9919492A2. 1 > 



SUBSTITUTE SHEET (RULE 26) 



WO 99/19492 



25 / 28 



PCT/EP98/06977 




8210 EcoRl.Sad. Kpn1 



Figure 16 



Comments/References: AtMSH3 (S8) complete, sense orientation : pPF26 (3342bp) 
Sma1 into pCW164 Sma1 
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Figure 17 



Comm«ntt/Atl«r«ne«»: pPZPIH with codA EooRl cassette tn EcoRl site and Hlnd3 
GUS cassette in Hin<J3 site, Kan A. A(l genes under PrwnoterAerminator 35 S 
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Figure 18 



partial cONA MSH 6 




Kpn I | Sea I 
Eco Rl 



Xmn I 
BamHI 



cDNA 
clon 62 



Sal I 



Pstl 
Sph I 
Hind III 



partial cONA MSH 3 




Kpn I Hind ill 




BamHI 



Kpni Ava I Sma I Bgl II Nco I Kpnl Sad 



DMC1 promoter 1 .8 kb 



EcoRI 



p2031 



t-NOS 
0.25 kb 



cDNA 
clon 13 



.||_ p2033 



EcoRI 



Amp 

ColE1 
ori 



Eco RV Pst I 




Bam HI 



EcoRI 



MCS 



LB pNOS 
0.35 kb 



Hyg" 

1.1 kb 



t-NOS 
0.25 kb 



RB over 
drive 



pNos-Hyg-SCV 10.2 kb 



Gm/Km 



RK2 
ori 



p3242 



iSDOClO: <WO_9919492A2_L> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/1 9492 



28 / 28 



PCT/EP98/06977 



p3243 



Figure 19 
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Rhone-Poulenc Agro; Beczner, Andreas Stefan; Doucriaux, 
Marie-Pascale; Freyssinet. Georges; Perez, Pascual . 



<120> Methods for obtaining plant varieties 

<130> 395498C 

<150> P09745 

<151> 1997-10-10 

<160> 98 

<210> 1 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> modif ied_base 

<222> 11 

<223> I 

<220> 

<221> modif ied__base 

<222> 14 

<223> I 

<220> 

<221> modif ied — base 

<222> 17 

<223> I 

<220> 

<223> Degenerate oligonucleotides UPMU used to isolate AtMSH3 and 
AtMSHG . 

<300> 

<3 0l> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 

<400> 1 

ctggatccac nggnccnaay atg 23 

<210> 2 

<211> 23 

<212> DNA 
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2 . 

<213> Arcificial sequence 
<220> 

< 2 2 1 > modi f ied_base 

<222> 15 

<223> I 

<220> 

<221> modif ied_base 

<222> 18 

<223> I 

<220> 

<223> Degenerate oligonucleotides DOMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<3 01> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> . 963-973 

<3.07> 1992 

<400> 2 

ctggatccrt artgngtnrc raa 23 



<210> 3 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 636 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 3 



tgctagtgcc tcttgcaagc teat 



<210> 4 
<211> 27 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer API for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<400> 4 
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27 



<210> 5 
<211> 23 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer AP2 for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<400> 5 

actcactata gggctcgagc ggc 23 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 

thaliana ecocype Columbia 

<400> 6 

aggttctgat tacgtgtgac gctttactta 30 



<210> 7 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 7 

ggatcgggta ctgggttttg agtgtgagg 29 



<210> 8 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 635 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 
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<400> 8 

gcacgtgctt gatggtgtct tcac 24 

<210> 9 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> • MSH3 specific primer S523 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 9 

tcagacagta tccagcatgg cagaagta 28 



<210> 10 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 1S5 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 

<400> 10 

atcccgggac gggcaagcaa aagcagcaga cga 3 3 

<210> 11 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S53 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 11 



gacaaagagc gaaatgaggc cccttgg 



<210> 12 

<211> 1250 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 52 
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<400> 12 

cccgggacgg gcaagcaaaa gcagcagacg acccctcgtt Ccttcgctcc caaacccaaa 60 
tccccgactc acgaaccgaa tccggcagcc gaatcatcaa caccgccacc gaagacatcc 120 
gccaccgtat ccctctctcc ccccaagcgc aagcccccct ccgaccacct cgccgccgcg 180 
tcacccaaaa agcctaaact ctctcctcac acccaaaacc cagcacccga ccccaattta 240 
caccaaagat ttctccagag atttccggaa ccctcgccgg aggaatacgt ccccgaaacg 300 
ccaccatcga ggaaatacac accaccggaa cagcaagcgg tggagctaaa gagcaagtac 3 60 

ccagatgtgg ttxcgatggt ggaagttggt tacaggtaca gattcttcgg agaagacgcg 420 
gagaccgcag cacgcgtgtc gggtatttac gcccacacgg accacaattt catgacggcg 480 
agcgcgccaa cattccgatt gaatttccat gtgagaagac tggtgaacgc aggacacaag 540 
accggtgcag cgaagcagac cgaaaccgca gccactaagc cccacggcgc aaaccggacc 600 
ggcccttttt cccggggacc gccggcgccg tacaccaaag ccacgcctga agcggccgag 660 
gatacaagcg gcgg^tgcgg cggtgaagaa ggttctggcc cacagagtaa tttcttggtt 720 
tgcgttgtgg atgagagagt taagtcggag acattaggcc gcggcattga aacgagtttt 780 
gatgttagag ccggtgttgt tggcgctgaa atttcgacag gtgaagccgt tcacgaagag S40 
tccaacgaca acctcatgag aagcggacca gaggctgtga trctgagctt gccaccagct 900 
gagctgtcgc ccggccagcc tctttcacaa caaaccgaga agt ccttggt ggcacatgcc 960 

ggacctacct caaacgtccg agcggaacgc gccccaccgg accgcctcag caacggtaat 1020 

gcagtagacg aggccatccc actatgcgaa aaaaccagcg caggcaactt agaagatgac 10 80 

aaagaaatga agctggaggc tgctgaaaaa ggaatgtctt gcttgacagt tcacacaatt 1140 

acgaacatgc cacatctgac tgtccaagcc ctcgccctaa cgccttgcca tctcaaacag 12 00 

tctggatttg aaaggatcct tcaccaaggg gcctcacccc gcccttcgtc 1250 

<210> 13 



<220> 

<223> MSH3 specific primer 2S5 

thaliana ecotype Columbia 
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<400> 13 

accccgggcc aaaacgaaca agtcggcttt agtc 34 

<210> 14 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S52 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 

<400> 14 

gccacacccg accgttcaag ccctcgc 27 

<210> IS 

<211> 2110 

<2I2> DNA 

<213> Arabidcpsis zhaliana ecotype Columbia 

<223> Clone 13 

<400> 15 

gccacatctg actgttcaag ccctcgccct aacgtttcgc catcccaaac agtttggatt 60 

cgaaaggatc ccccaccaag gggccrcatt tcgctctttg ccaagtaaca cagagatgac 120 

ccucccagcc aacactccgc aacagccgga ggtcgngaaa aacaactcag atggaccgga 180 

acccggcccc tcattccaca atacgaacca cacacccaca gcacatggtc ccaggcttcc 240 

cagacaccgg gtgactcacc ctctatgcga tagaaacttg atacctgctc ggcttgatgc 3 00 

cgcccccgag atttccgcct gcacgggacc tcatagttct ccccagccca gcagcgagtt . 360 

ggttgaagaa ggttccgaga gagcaattgt atcacctgag ttttacctcg tgctctcctc 4 20 

agtcctgaca gctatgtcta gatcatccga cattcaacgc ggaataacaa gaatctttca 480 

ccggactgct aaagccacag agttcatcgc agttatggaa gctattttac ttgcggggaa 540 

gcaaactcag cggcccggca taaagcaaga ctctgaaacg aggagtatgc aatctgcaac 6 00 

tgcgcgatct actcttctga gaaaattgat ttctgttatt ccatcccctg ttgtggttga 660 

caacgccgga aaacttctct ctgccccaaa taaggaagcg gccgctcgag gtgacttgct 720 

cgacacacta accacttcca gcgaccaatc tcctgagctc gctgaagctc gccaagcagt 780 

cctagccatc agggaaaagc cggacccccc gatagcctca cctcgcaaga agcccgctat 840 
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catgcttgga 
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2100 



tcgacccggg 2110 

<210> 16 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 16 
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<210> 17 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 

<400> 17 

aggttccgat tatgtgtgac gcttcaccta 30 



<210> 18 

<211> 3522 

<212> DNA 

<2I3> Arabidopsis chaliana ecotype Columbia 
<220> 

<221> CDS 

<222> (100) .... (3342J 

<223> ACMSH3 full -length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 18 

cctaagaaag cgcgcgaaaa ttggcaaccc aagttcgcca tagccacgac cacgaccttc 6 0 

catttctc-t aaacggagga gattacgaat aaagcaatt 99 

atg ggc aag caa aag cag cag acg att tct cgt ttc ttc get ccc aaa 147 
Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 
1 5- 10 15 

ccc aaa tec ccg act cac gaa ccg aat ccg gta gec gaa tea tea aca 
Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

ccg cca ccg aag ata tec gee act gta tec ttc tct cct tec aag cgt 243 
Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 



195 



aag ctt etc tec gac cac etc gee gec gcg tea ccc aaa aag cct aaa 
Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

ctt tct cct cac act caa aac cca gta ccc gat ccc aat tta cac caa 
Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 



291 



339 
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aga ctt etc cag aga ttt ctg gaa ccc ccg ccg gag gaa tat gtt ccc 387 
Arg Phe Leu Gin Arg Phe Leu GIu Pro Ser Pro Glu Glu Tyr Val Pro 

85 90 95 

gaa acg tea cca ccg agg aaa cac aca cca ctg gaa cag caa gtg gtg 43 5 

Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 

100 105 110 

gag eta aag age aag tac cca gat gtg gtt ttg atg gtg gaa gtt ggt 483 

Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 125 

tac agg tac aga ttc ttc gga gaa gac gcg gag ate gca gca cgc gtg 531 

Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu lie Ala Ala Arg Val 
130 135 140 

ttg ggt att tac get cat atg gat cac aat ttc atg acg gcg agt gtg 579 

Leu Gly He Tyr Ala His Met Asp His Asn Phe Met Thr Ala Ser Val 
145 ISO 155 160 

cca aca ttt cga ttg aat ttc cat gtg aga aga ctg gtg aat gca gga 627 

Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
165 170 175 

tac aag att ggt gta gtg aag cag act gaa act gca gec att aag tec 675 

Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser 

180 185 190 

cat ggt gca aac egg acc ggc cct ttt ttc egg gga ctg teg gcg ttg 723 

His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 205 

tat acc aaa gec acg ctt gaa gcg get gag gat ata agt ggt ggt tgt 771 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp lie Ser Gly Gly Cys 
210 215 220 

ggt .ggt gaa gaa ggt ttt ggt tea cag agt aat ttc ttg gtt tgt gtt 819 

Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 230 235 240 

gtg gat gag aga gtt aag teg gag aca tta ggc tgt ggt att gaa atg 86 7 

Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 250 255 

agt ttt gat gtt aga gtc ggt gtt gtt ggc gtt gaa att teg aca ggt 915 

Ser Phe Asp Val Arg Val Gly Val Val Gly Val Glu He Ser Thr Gly 

260 265 270 

gaa gtt gtt tat gaa gag ttc aat gat aat ttc atg aga agt gga tta 963 

Glu Val Val Tyr Glu Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu 
275 280 285 
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gag gcc gtg ate ttg age teg tea cca get gag ctg ctg ctt ggc cag 1011 

GLu Ala Val lie Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 

250 295 300 

cc: cc: tea caa caa act gag aag ttt ttg gtg gca cat get gga cct 1059 

Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 

305 310 315 320 

acc tea aac gtt cga gtg gaa cgt gcc tea ctg gat tgt ttc age aat 1107 

Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 

325 330 335 

ggt aat gca gta gat gag gtt att tea tta tgt gaa aaa ate age gca 1155 

Gly Asn Ala Val Asp Glu Val lie Ser Leu Cys Glu Lys lie Ser Ala 

340 345 350 



ggt aac tta gaa gat gat aaa gaa atg aag ctg gag get get gaa aaa 
Gly Asn Leu Glu Asp Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys 

355 360 365 



1203 



1395 



gga atg tct tgc ttg aca gtt cat aca att atg aac atg cca cat ctg 1251 
Gly Met Ser Cys Leu Thr Val His Thr lie Met Asn Met Pro His Leu 
370 375 360 

act gtt caa gcc etc gcc eta aeg ttt tgc cat etc aaa cag ttt gga 1299 
Thr Val Gin Ala Leu Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly 
385 390 395 400 

ttt gaa agg ate ctt tac caa ggg gcc tea ttt cge tct ttg tea agt 1347 
Phe Glu Arg lie Leu Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser 
405 410 415 

aac aca gag atg act etc tea gcc aat act ctg caa cag ttg gag gtt 
Asn Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val 
420 425 430 

gtg aaa aat aat tea* gat gga teg gaa tct ggc tec tta ttc cat aat 144 3 

Val Lys Asn Asn Ser Asp Gly Ser Glu Ser Gly Ser -Leu Phe His Asn 
435 440 445 

atg aat cac aca ctt aca gta tat get tec agg ctt ctt aga cac tgg 1491 
Met Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg His Trp 
450 455 460 

gtg act cat cct eta tgc gat aga aat ttg ata tct get egg ctt gat 153 9 

Val Thr His Pro Leu Cys Asp Arg Asn Leu lie Ser Ala Arg Leu Asp 
465 470 475 480 

get gtt tct gag att tct get tgc atg gga tct cat agt tct tec cag 1587 
Ala Val Ser Glu lie Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin 
485 490 495 
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ccc age age 
Leu Ser Ser 



ccc gag ctt 
Pro Glu Phe 
515 

cca ccc gac 
Ser Ser Asp 
530 

aaa gec aca 
Lys Ala Thr 
545 

aag caa acc 
Lys Gin He 



acg caa ccc 
Met Gin Ser 



get acc cca 
Val He Ser 
595 

gec cca aac 
Ala Leu Asn 
610 

acc acc ccc 
He Thr Ser 
625 

gtc cca gec 
Val Leu Val 



aag aag etc 
Lys Lys Leu 



aca cac teg 
Thr His Leu 
675 

gcg aaa gta 
Val Lys Val 
690 



gag ccg gec 
Glu Leu Val 
500 

cac ccc gcg 
Tyr Leu Val 



act caa cgc 
He Gin Arg 



gag ccc acc 
Glu Phe He 
550 

cag egg ccc 
Gin Arg Leu 
565 

gca acc gcg 
Ala Thr Val 
580 

ccc ccc gec 
Ser Pro Val 



aag gaa gcg 
Lys Glu Ala 



age gac caa 
Ser Asp Gin 
630 

ate agg gaa 
He Arg Glu 
645 

get att cga 
Ala He Arg 
660 

ata gag ctg 
He Glu Leu 



aat age acc 
Asn Ser Thr 



gaa gaa ggc 
Glu Glu Gly 
505 

ccc ccc cca 
Leu Ser Ser 
520 

gga aca aca 
Gly He Thr 
535 

gca get atg 
Ala Val Met 



ggc ata aag 
Gly He Lys 



cga cce acc 
Arg Ser Thr 
585 

gcg gee gac 
Val Val Asp 
600 

gec gec cga 
Ala Val Arg 
615 

ccc ccc gag 
■Phe Pro Glu 



aag ceg gat 
Lys Leu Asp 



aat ttg gaa 
Asn Leu Glu 
665 

ccc gtt gat 
Pro Val Asp 
680 

aag aag act 
Lys Lys Thr 
695 
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tct gag aga 
Ser Glu Arg 



gee ccg aca 
Val Leu Thr 



aga acc ctt 
Arg He Phe 
540 

gaa get att 
Glu Ala He 
555 

caa gac tct 
Gin Asp Ser 
570 

etc ctg aga 
Leu Leu Arg 



aac gec gga 
Asn Ala Giy 



ggc gac ccg 
Gly Asp Leu 
620 

ctt get gaa 
Leu Ala Glu 
635 

tec teg ata 
Ser Ser He 
650 

ttt ctt caa 
Phe Leu Gin 



tec aag gtc 
Ser Lys Val 

att cga tat 
He Arg Tyr 
700 



gca ate gca 
Ala He Val 
510 

gee acg ccc 
Ala Mec Ser 
525 

cat egg act 
His Arg Thr 



tta ctt gcg 
Leu Leu Ala 



gaa atg agg 
Glu Met Arg 
575 

aaa ttg ate 
Lys Leu He 
590 

aaa ctt etc 
Lys Leu Leu 
605 

ccc gac aca 
Leu Asp He 



gee cgc caa 
Ala Arg Gin 



get tea ttt 
Ala Ser Phe 
655 

gtg teg ggg 
Val Ser Gly 
670 

cct atg aat 
Pro His Asn 
685 

cat ccc cca 
His Pro Pro 



cca 1635 
Ser 



aga 1683 
Arg 



gee 1731 
Ala 



ggg 1779 

Gly 

560 

agt 1827 
Ser 



tct 1875 
Ser 



tct 1923 
Ser 



cca 1971 
Leu 



gca 2019 

Ala 

640 

cgc 206 7 

Arg 



ate 2115 
He 



tgg 216 3 

Trp 



gaa 2211 
Glu 
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aca gca gec ggc ccg gat gag cca gec cca gca ac: gaa cat etc gee 2259 

lie Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 715 720 

acc gtg aac cga get ccg tgg gat age ccc ccc aag age ccc age aga 2307 
lie Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 

tac cac aca gat ccc aag gec gec get caa gec ccc get gca ceg gac 23 55 

Tyr Tyr Thr Asp Phe Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

cgt ttg cac ccc ccc Cca acc cca cce aga aac aag aac tat gtc cgt 2403 
Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 

ccc gag ccc gtg gat gac cgc gaa cca gec gag aca aac ata cag tec 2451 
Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu lie Asn lie Gin Ser 
770 775 780 

ggc cgc cac ccc gca ccg gag acc aca cca caa gac aac ccc gec cca 2499 
Gly Arg His Pro Val Leu Giu Thr lie Leu Gin Asp Asn Phe Val Pro 
785 790 795 300 

aac gac aca acc ccg cac gca gaa ggg gaa cat cgc caa ate aec acc 2547 
Asn Asp Thr lie Leu His Ala Glu Gly Glu Tyr Cys Gin lie lie Thr 
805 810 815 

gga ccc aac acg gga gga aag age cgc cac acc cgc caa gee gee cca 2595 
Gly Pro Asn Met Gly Gly Lys Ser Cys Tyr lie Arg Gin Val Ala Leu 
820 825 830 

acc ccc aca acg gee cag gec ggc ccc ccc gca cca gcg cca eec gec 2643 
lie Ser lie Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
835 840 845 

aag ctg cac gtg etc gat ggt gee eec acc egg acg gge gee tea gac 2691 
Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Mec Gly Ala Ser Asp 
850 855 860 ■ 

age aec cag cac ggc aga age acc cee cca gaa gaa eea age gaa gcg 273 9 

Ser lie Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
865 870 875 880 

tea cac aca aec aga acc cgc ccc ect cgc ccg cec gee aea cca gac 2787 
Ser His lie lie Arg Thr Cys Ser Ser Arg Ser Leu Val lie Leu Asp 
885 890 895 

gag ctt gga aga ggc act age aca cac gac ggc gea gec att gec eat 283 5 

Glu Leu Gly Arg Gly Thr Ser Thr His Asp Gly Val Ala lie Ala Tyr 
900 905 910 



INSDOCID: cWO 9919492A2 » > 



WO 99/1 9492 PCT/EP98/06977 

13 

gca aca tea cag cat etc cca gca gaa aag aga cgc ctg get etc ttt 2883 
Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

gtc acg cat tac cct gaa ata get gag ate agt aac gga ttc cca ggt 2931 
Val Thr His Tyr Pro Glu lie Ala Glu He Ser Asn Gly Phe Pro Gly 
930 935 940 

tct gtt ggg aca tac cat gtc teg tat ctg aca ttg cag aag gat aaa 2979 
Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

ggc agt tat . gat cat gat gat gtg acc tac eta tat aag ctt gtg cgt 3027 
Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

ggt ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gec cag 3075 
Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

ata cct cca tea tgt ata cgt cga gec att tea atg get gca aaa ttg 3123 
He Pro Pro Ser Cys He Arg Arg Ala lie Ser Met Ala Ala Lys Leu 
995 1000 1005 

gaa get gag gta cgt gca aga gag aga aa: aca cgc atg gga gaa cca 3171 
Glu Ala Glu Val- Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

gaa gga cat gaa gaa ccg aga ggc gca gaa gaa tct att teg get eta ,3219 
Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
1025 1030 1035 1040 

ggt gac ttg ttt gca gac ctg aaa ttt get etc tct gaa gag gac cct 3267 
Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

tgg aaa gca ttc gag ttt tta aag cat get tgg aag att get ggc aaa 3315 
Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 

ate aga eta aaa cca act tgt tea ttt tgatttaatc ttaacattat 3362 
He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 

ageaactgea aggtcttgat catctgttag ttgegtacta acttatgtgt attagtataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt tgcagtacat cttttctcca 3482 

cccgcacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3 522 



<210> 19 
<211> 1081 
<212> PRT 
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<213> Arabidopsls chaliana ecotype Columbia 

<223> Polypeptide MSH3 

<400> 19 

Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 
1 5 10 15 

Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
. 50 55 60 

Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 

Arg Phe Leu Gin Arg Phe Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro 

85 90 95 

Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
100 105 110 

Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 125 

Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 

Leu Gly lie Tyr Ala His Met Asp His Asn Phe Met Thr Ala Ser Val 
145 150 155 160 

Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
165' 170 175 

Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser 
180 185 190 

His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 205 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp He Ser Gly Gly Cys 
210 215 220 

Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 230 235 240 

Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 250 255 
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Ser Phe Asp Val Arg Val Gly Val Val Gly Val Glu lie Ser Thr Gly 
260 265 270 

GLu Val Val Tyr Glu Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu 
275 280 285 



Glu Ala Val lie Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 

290 295 300 

Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 

305 310 315 320 



Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 
325 330 335 

Gly Asn Ala Val Asp Glu Val lie Ser Leu Cys Glu Lys lie Ser Ala 
340 345 350 



Gly Asn Leu Glu 
355 

Gly Met Ser Cys 
370 

Thr Val Gin Ala 
385 

Phe Glu Arg lie 



Asn Thr Glu Met 
420 

Val Lys Asn Asn 
435 

Met Asn His Thr 
450 

Val Thr His Pro 
465 

Ala Val Ser Glu 



Leu Ser Ser Glu 
500 



Asp Asp Lys Glu 
360 

Leu Thr Val Kis 
375 

Leu Ala Leu Thr 
390 

Leu Tyr Gin Gly 
405 

Thr Leu Ser Ala 



Ser Asp Gly Ser 
440 

Leu Thr Val Tyr 
455 

Leu Cys Asp Arg 
470 

lie Ser Ala Cys 
485 

Leu Val Glu Glu 



Met Lys Leu Glu 



Thr He Met Asn 
380 

Phe Cys His Leu 
395 

Ala Ser Phe Arg 
410 

Asn Thr Leu Gin 
425 

Glu Ser Gly Ser 



Gly Ser Arg Leu 
460 

Asn Leu He Ser 
475 

Met Gly Ser His 
490 

Gly Ser Glu Arg 
505 



Ala Ala Glu Lys 
365 

Met Pro His Leu 



Lys Gin Phe Gly 
400 

Ser Leu Ser Ser 
415 

Gin Leu Glu Val 
430 

Leu Phe His Asn 
445 

Leu Arg His Trp 



Ala Arg Leu Asp 
480 

Ser Ser Ser Gin 
495 

Ala He Val Ser 
510 



Pro Glu Phe Tyr 
515 

Ser Ser Asp He 
530 



Leu Val Leu Ser 
520 

Gin Arg Gly He 
535 



Ser Val Leu Thr 

Thr Arg He Phe 
540 



Ala Met Ser Arg 
525 

His Arg Thr Ala 
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Lys Ala Thr Glu Phe lie Ala Val Met Glu Ala lie Leu Leu Ala Gly 
545 . 550 555 560 

Lys Gin lie Gin Arg Leu Gly lie Lys Gin Asp Ser Glu Met Arg Ser 
565 570 575 

Met Gin Ser Ala Thr Val Arg Ser Thr Leu Leu Arg Lys Leu lie Ser 
580 585 590 

Val lie Ser Ser Pro Val Val Val Asp Asn Ala Gly Lys Leu Leu Ser 
595 600 605 

Ala Leu Asn Lys Glu Ala Ala Val Arg Gly Asp Leu Leu Asp lie Leu 
610 615 620 

lie Thr Ser Ser Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 
625 630 635 640 

Val Leu Val lie Arg Glu Lys Leu Asp Ser Ser lie Ala Ser Phe Arg 
645 650 655 

Lys Lys Leu Ala lie Arg Asn Leu Glu Phe Leu Gin Val Ser Gly lie 
660 665 670 

Thr His Leu Tie Glu Leu Pro Val Asp Ser Lys Val Pro His Asn Trp 
675 680 685 

Val Lys Val Asn Ser Thr Lys Lys Thr lie Arg Tyr His Pro Pro Glu 
690 695 700 

lie Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 715 720 

lie Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 

Tyr Tyr Thr Asp Phe~ Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 

Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu He Asn He Gin Ser 
770 775 780 

Gly Arg His Pro Val Leu Glu Thr He Leu Gin Asp Asn Phe Val Pro 
785 790 795 800 

Asn Asp Thr He Leu His Ala Glu Gly Glu Tyr Cys Gin He He Thr 
805 810 815 

Gly Pro Asn Met Gly Gly Lys Ser Cys Tyr He Arg Gin Val Ala Leu 
820 825 830 
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lie Ser lie Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
835 840 845 

Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Met Gly Ala Ser Asp 
850 855 860 

Ser lie Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
865 870 875 880 

Ser His lie lie Arg Thr Cys Ser Ser Arg Ser Leu Val lie Leu Asp 
885 890 895 

Glu Leu Gly Arg Gly Thr Ser Thr His Asp Gly Val Ala lie Ala Tyr 
900 905 910 

Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

Val Thr His Tyr Pro Glu lie Ala Glu lie Ser Asn Gly Phe Pro Gly 
930 935 940 

Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 

965 970 975 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

lie Pro Pro Ser Cys lie Arg Arg Ala lie Ser Men Ala Ala Lys Leu 
995 1000 1005 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser lie Ser Ala Leu 
1025 1030 1035 1040 

Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys lie Ala Gly Lys 
1060 1065 1070 

lie Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> MSH6 specific primer 638 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 

<400> 20 

cctccaccag gcgacgaaaa accg 



<210> 21 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer S81 for PCR using cDNA of Arabidopsis thaliana ecotype 
Columbia 

<400> 21 

cgtcgccrtt agcatcccct tccttcac 28 

<210> 22 

<211> 30 . 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S823 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 22 

gcttggcgca tctaatagaa tcatgacagg 3 0 

<210> 23 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 637 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 23 

gacagcgtca gttcttcaga atgc 24 



<210> 24 
<211> 33 
<212> DNA 
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<213> Artificial sequence 

- 2 2 C -> 

;223> MSH6 specific primer 1S3 for PCR using cDNA of Arabidopsis 

thaliana ecocype Columbia 

<400> 24 

atcccgggat gcagcgccag agatcgattt tgt 33 

<210> 25 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S83 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<40C> 25 



cgc:a:cca: 


ggctgcc teg 


aaccgag 








27 


<21C> 
<211> 
<212> 
<213> 
<223> 


26 

1385 
DNA 

Arabidopsis thaliana ecotype Columbia 
Clone 4 3 






<40C> 


26 












cccgggatgc 


agegecagag 


atcgatt ttg 


tctttcttcc 


aaaaacccac 


ggeggegact 


60 


acgaagggtt 


tggtttccgg 


egatgetget 


ageggegggg 


geggcagegg 


aggaccacga 


120 


tttaatgtga 


aggaagggga 


tgecaaagge 


gaegcttctg 


tacgttttgc 


tgtttcgaaa 


180 


tctgtcgatg 


aggttagagg 


aacggatact 


ccaccggaga 


aggttccgcg 


tcgtgtcctg 


240 


ccgtctggat 


ttaagcegge 


tgaatccgcc 


ggtgatgctt 


cgtccctgtt 


ctccaatatt 


300 


atgcataagt 


ttgtaaaagt 


cgatgaccga 


gattgttctg 


gagagaggag 


ccgagaagat 


360 


gttgttccgc 


tgaatgattc 


atctctatgt 


atgaaggcta 


atgatgttat 


tccccaattt 


420 


cgttccaata 


atggtaaaac 


tcaagaaaga 


aaccatgett 


ttagtttcag 


tgggagagct 


480 


gaacctagat 


cagtagaaga 


tataggagta 


gatggcgatg 


ttcctggtcc 


agaaacacca 


540 


gggatgcgtc 


cacgtgcttc 


tegcttgaag 


cgagttctgg 


aggatgaaat 


gacntttaag 


600 


gaggataagg 


ttcctgtatt 


ggactctaac 


aaaaggctga 


aaatgetcca 


ggatccggtt 


660 
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cgcggagaga 


agaaagaagt 


aaacgaagga 


accaaacccg 


aacggcccga 


gtccccccga 


720 


accagggacg 


ccaatagaag 


acgtcccgat 


gacccccctt 


acgatagaaa 


gaccttacac 


730 


ataccacccg 


atgtctccaa 


gaaaatgtcc 


gcancacaaa 


agcaacactg 


gagcgtcaag 


840 


agtgaatata 


tggacancgt 


gcttttctct 


aaagcgggga 


aatuttatga 


gctgcatgag 


900 


ctagatgcgg 


aattaggtca 


caaggagcct 


gactggaaga 


tgaccatgag 


tggtgtggga 


960 


aaatgcagac 


aggtcggtac 


ctccgaaagc 


gggacagatg 


aggcagcgca 


aaagctatta 


1020 


gctcgtggat 


acaaagttgg 


acgaaccgag 


cagctagaaa 


catctgacca 


agcaaaagcc 


1080 


agaggtgcta 


acactataat 


tccaaggaag 


ctagcccagg 


catcaactcc 


atcaacagca 


1140 


agcgagggaa 


acatcgggcc 


cgatgccgtc 


cacccrcctg 


ccacaaaaga 


gatcaaaacg 


1200 


gagctacaaa 


agcgttcaac 


cgcgca tgga 


crtgczcttg 


ttgactgtgc 


tgccttgagg 


1250 


ccccggg" g 


ggtccaccag 


cgacga:gca 


ccatgtgccg 


ctcccggagc 


gtcactgacg 


1320 


caggzzzczc 


caaaggaagc 


gc:aca:gac 


agcaaagggc 


catcaagaga 


agcacaaaag 


1330 


gctctaagga 


aatacacgtc 


gacagggcc. 


acggcggcac 


agttggcccc 


agtaccacaa 


1440 


gcaatggggg 


acacagatgc 


tgccggagct 


agaaatataa 


tagaatccaa 


cggatacttt 


1500 


aaaggccccc 


ccgaatcatg 


gaactgcgcc 


gttgatggcc 


taaacgaatg 


tgatgccgcc 


1550 


cccagcgctc 


tcggagagct 


aaccaaccac 


ccgnccaggc 


taaagctaga 


agatgtactt 


1620 


aagcacgggg 


acacctctcc 


ataccaagct 


cacaggggtt 


gtcucagaat 


cgatggccag 


1680 


acgacggtaa 


atcttgagat 


atttaacaac 


agccgtgatg 


gtggcccttc 


agggaccctg 


1740 


cacaaacacc 


tcgataaccg 


tgttagtcc;a 


accggcaagc 


gactcttaag 


gaattggacc 


1800 


cgccacccac 


tcaaagatgt 


agaaagcatc 


aacaaacggc 


ccgacgtagc 


tgaagaaccc 


1860 


acggcaaacc 


cagaaagtat 


gcaaaccacc 


ggccagcacc 


tccacaaact 


cccagaccta 


1920 


gaaagactgc 


tcggacgcat 


caagtccagc 


gctcgatcat 


cagcctctgc 


gttgcctgct 


1980 


cttctgggga 


aaaaagtgct 


gaaacaacga 


gttaaagcat 


ttgggcaaat 


tgcgaaaggg 


2040 


tccagaagcg 


gaattgatct 


gctgccggcc 


ctacagaagg 


aaccaaatat 


gacgagctcg 


2100 


ctttataaac 


tctgtaaact 


tcctacacca 


gtaggaaaaa 


gcgggctaga 


gctacccctc 


2160 


tctcaattcg 


aagcagccat 


agacagcg 








2188 
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<210> 
<211> 
<212> 
<213> 
<223> 

<400> 



21 



27 

1385 
DNA 

Arabidopsis chaliana ecotype Columbia 
Clone 62 



27 



catcagcctc tgcgccgcct gctcttctgg ggaaaaaagt gccgaaacaa cgagttaaag 60 

catctgggca aattgngaaa gggctcagaa gtggaattga tcngttgttg gctctacaga 120 

aggaaccaaa tatgacgagt ttgctctata aactctgtaa acttcctata tcagtaggaa 180 
aaagcgggct agagccattt ctttctcaat ccgaagcagc catagatagc gactttccaa . 240 

attatcagaa ccaagatgtg acagatgaaa acgctgaaac tcccacaata cttatcgaac 300 

cnttcatcga aagagcaact caacggtccg aggtcactca caccataagc cgcccagatg 360 

tcccgagatc ttccgcaacc gcagcaagcc cctccgctgg aagcacggcc aggcctgtca 420 

ctttccccga atcagaagct acagarcaga accagaaaac aaaagggcca acacccaaaa 480 

cccaaggacc acggcaccca cccgcagccg cagccgacgg tcaattgcc: gccccgaacg 540 

atatactccc tggcgaggct agaagaagca gcggcagcat tcaccctcgg ccactgttac 600 

tgacgggacc aaacatgggc ggaaaaccaa ctcttcctcg tgcaacatgt ctggccgtca 660 

cctctgccca acctggccgc tacgtgccgt gcgagcctcg cgaaatcccc ctcgtggata 720 

ccacccccac aaggcttggc gcatctgata gaaccacgac aggagagagt accttttcgg 780 

cagaacgcac cgagacagcg ccagttcttc agaacgcaac ccaggactca ctagtaatcc 840 

tcgacgaact gggcagagga actagcactc tcgatggata cgccatcgca cactcggttt 9O0 

ttcgccacct ggtagagaaa gttcaatgtc ggatgctctt tgcaacacat taccaccctc 960 

tcaccaagga attcgcgccc cacccacgtg tcacctcgaa acacatggct tgcgcattca 1020 

aatcaagatc tgattatcaa ccacgcggtt gcgatcaaga cctagtgttc ttgtaccgtt 1080 

taaccgaggg agcttgtcct gagagctacg gacttcaagt ggcactcatg gctggaatac 114 0 

caaaccaagt ggttgaaaca gcatcaggtg ctgctcaagc catgaagaga tcaattgggg 120 0 

aaaacttcaa gtcaagtgag ctaagatctg agttctcaag tctgcatgaa gaccggccca 126 0 

agtcattggt gggtacttct cgagtcgccc acaacaatgc ccccactggc gaagatgact 132 0 

acgacacctt gctctgccca cggcacgaga tcaaatcctc tnaccgtgct cccaaacaac 138 0 
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1385 



<210> 28 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 



<223> 


MSH6 specific primer 2S8 for 


PCR using cDNA of Arabidopsis 




thaliana ecotype Columbia 




<400> 


26 






ar r r cma a ?i c acacrt aaaaci oatt 

C* www y y as w u, w» ** <-* *-* 


34 


<21D> 






< 21 1 > 






<212> 


DNA 




<213> 


Artificial sequence 




< Z 0 > 






<223 > 


VI C* T_I C ts a "i "4 '/■» V i mo v- C Q ^ f y 

/lano spscitlC piiiriei ooz Lui 


D<~P nc-i nrr rDMA o f ArabidoDSlS 




f-V^^T t ana ornr v^P Pnl i irph i a 
L, ua J. X alia cuuLy^c uuiuniuxa 




<400> 


29 




gcgtccgatc 


accagcccct gtgttgc 


27 


<210> 


30 




<211> 


3606 




<212> 


DNA 




<213> 


AraJbidopsis thaliana ecotype 


Columbia 


<220> 






<221> 


CDS 




<222> 


(142) (3468) 




<223> 


AtMSH6 full-length cDNA and 


deduced sequence of the encoded 




polypeptide 




<400> 


30 





aaaagttgag ccctgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt 6 0 

tggcgccaat ctttcccccc tttcgaattc tctcagctca aaacatcgtt tctctctcac 120 

tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg tct ttc 171 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe 
1 5 10 
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tec caa aaa ccc acc gcg gcg act acg aag ggt ttg gtt tec ggc gat 219 

Phe Gin Lys Pro Thr Ala Ala Thr Thr Lys Gly Leu Val Ser Gly Asp 

15 20 25 

get get age ggc ggg ggc ggc age gga gga cca cga ttt aat gtg aag 267 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Gly Pro Arg Phe Asn Val Arg 
30 35 40 

gaa ggg gat get aaa ggc gac get tct gta cgt ttt get gtt teg aaa 315 
Glu Gly Asp Ala Lys Gly Asp Ala Ser Val Arg Phe Ala Val Ser Lys 
45 50 55 

tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 363 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lys Val Pro 
60 65 70 

cgt cgt gtc- ctg ccg tct gga ttt aag ccg get gaa tec gee gst gat 411 
Arg Arg Val Leu Pro Ser Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp 
75 80 85 90 

get teg tec ctg ttc tec aat art atg cat aag ttt gta aaa gtc gat 459 
Ala- Ser Ser Leu Phe Ser Asn lie Met His Lys Phe Val Lys Val Asp 

95 100 105 

gat cga gat tgt tct gga gag age age cga gaa gat gtt gtt ccg ctg 507 
Asp Arg Asp Cys Ser Gly Glu Arg Ser Arg Glu Asp Val Val Pro Leu 
110 115 120 

aat gat tea tct eta tgt atg aag get aat gat gtt att cct caa ttt 555 
Asn Asp Ser Ser Leu Cys Met Lys Ala Asn Asp Val lie Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ser Asn Asn Gly Lys Thr Gin Glu Arg Asn His Ala Phe Ser Phe 
140 145 150 

agt ggg aga get gaa ctt aga tea gta gaa gat ata gga gta gat ggc 651 
Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp lie Gly Val Asp Gly 
155 160 165 170 

gat gtt cct ggt cca gaa aca cca ggg atg cgt cca cgt get tct cgc 699 
Asp Val Pro Gly Pro Glu Thr Pro Gly Met Arg Pro Arg Ala Ser Arg 

175 180 185 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 747 
Leu Lys Arg Val Leu Glu Asp Glu Met Thr Phe Lys Glu Asp Lys Val 
190 195 200 

cct gta ttg gac tct aac aaa agg ctg aaa atg etc cag gat ccg gtt 795 
Pro Val Leu Asp Ser Asn Lys Arg Leu Lys Met Leu Gin Asp Pro Val 
205 210 215 



SDOCIO <WO 9919492A2J_> 



WO 99/19492 PCT/EP98/06977 

24 

cgt gga gag aag aaa gaa gta aac gaa gga acc aaa tec gaa egg etc 84 3 

Cys Gly Giu Lys Lys Glu Val Asn Glu Gly Thr Lys Phe Glu Trp Leu 
220 225 230 

gag ccc cc: cga acc agg gac gec aac aga. aga cgc ccc gac gac ccc 
Glu Ser Ser Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 250 

etc cac gac aga aag acc cca cac aca cca ccc gat gtt ttc aag aaa 
Leu Tyr Asp Arg Lys Thr Leu His lie Pro Pro Asp Val Phe Lys Lys 
255 260 265 

atg ccc gca cca caa aag caa tac egg age gec aag age gaa cac atg 
Met Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met 
270 275 280 

gac att gcg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ctg cat gag 
Asp He Val Leu Phe Phe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 
295 290 295 

eta gac gcg gaa tea ggc cac aag gag ccc gac tgg aag atg acc acg 
Leu Asp Ala Glu Leu Gly His Lys Glu Leu Asp Trp Lys Met Thr Met 
30*0 305 310 

age ggc gcg gga aaa cgc aga cag gee ggc acc tct gaa agt ggg ata 
Ser Gly Val Gly Lys Cys Arg Gin Val Gly He Ser Glu Ser Gly lie 
315 320 325 330 

gat gag gca gcg caa aag cca eta gec cgc gga cac aaa gtc gga cga 
Asp Glu Ala Val Gin Lys Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg 
335 340 345 

acc gag cag cca gaa aca ccc gac caa gca aaa gec aga ggc gec aat 
He Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn 
350 355 360 

act ata acc cca agg aag eta get cag gta tta act cca tea aca gca 1275 
Thr lie lie Pro Arg Lys Leu Val Gin Val Leu Thr Pro Ser Thr Ala 
365 370 375 



age gag gga aac ate ggg cct gat gec gtc cat cct ctt get ata aaa 
Ser Glu Gly Asn He Gly Pro Asp Ala Val His Leu Leu Ala He Lys 
380 385 390 

gag ate aaa atg gag eta caa aag tgt tea act gtg tat gga ttt get 
Glu He Lys Met Glu Leu Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala 
395 400 405 410 

ttt gtt gac tgt get gee ttg agg ttt tgg gtt ggg tec ate age gat 
Phe Val Asp Cys Ala Ala Leu Arg Phe Trp Val Gly Ser lie Ser Asp 
415 420 425 



891 



939 



987 



1035 



1083 



1131 



1179 



1227 



1323 



1371 



1419 
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gac gca cca cgc gcc get etc gga gcg tea ccg acg cag gtt tct cca 

Asp Ala Ser Cys Ala Ala Leu Gly Ala Leu Leu Met Gin Val Ser Pro 
430 435 440 

aag gaa gcg tea cac gac agt aaa ggg cca tea aga gaa gca caa aag 
Lys Glu Val Leu Tyr Asp Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys 
445 450 455 

get cca agg aaa tat acg ttg aca ggg tct acg gcg gta cag ttg get 
Ala Leu Arg Lys Tyr Thr Leu Thr Gly Ser Thr Ala Val Gin Leu Ala 
460 465 470 

cca gta cca caa gta atg ggg gat aca gac get get gga gtt aga aat 
Pro Val Pro Gin Val Met Gly Asp Thr Asp Ala Ala Gly Val Arg Asn 
475 480 485 490 

ata ata gaa tct aac gga tac ttt aaa ggt tct tct gaa tea tgg aac 
lie lie Glu Ser Asn Gly Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn 
495 500 505 

cgc gcc gcc gat ggc eta aac gaa tgt gac gcc gcc ccc agt -gcc cct 
Cys Ala Val Asp Gly Leu Asr. Glu Cys Asp Val Ala Leu Ser Ala Leu 
510 515 520 

gga gag cca act aat cat ctg tec agg eta aag cca gaa gat "gta ccc 
Gly Glu Leu lie Asn His Leu Ser Arg Leu Lys Leu Glu Asp Val Leu 
525 530 535 

aag cac ggg gat att ttt cca tac caa gtt tac agg ggt cgt etc aga 
Lys His Gly Asp lie Phe Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg 
540 545 550 

ate gat ggc cag acg atg gta aat etc gag aca ccc aac aat age tgt 
lie Asp Gly Gin Thr Met Val Asn Leu Glu lie Phe Asn Asn Ser Cys 
555 560 565 570 

gat ggt ggt cct tea ggg ace ttg tac aaa cat etc gat aac tgt gtt 
Asp Gly Gly Pro Ser Gly Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val 
575 580 585 

agt cca act ggt aag cga etc tta agg aat tgg ate tgc cat cca etc 
Ser Pro Thr Gly Lys Arg Leu Leu Arg Asn Trp lie Cys His Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gta gtt gaa gaa ttc 
Lys Asp Val Glu Ser lie Asn Lys Arg Leu Asp Val Val Glu Glu Phe 
605 610 615 

acg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 
Thr Ala Asn Ser Glu Ser Met Gin lie Thr Gly Gin Tyr Leu His Lys 
620 625 630 



1467 



1515 



1563 



1611 



1659 



1707 



1755 



1803 



1851 



1899 



1947 



1995 



2043 
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ctt cca gac eta gaa aga ctg etc gga cgc ate aag tec age gtc cga 2091 
Leu Pro Asp Leu Glu Arg Leu Leu Gly Arg lie Lys Ser Ser Val Arg 
635 640 645 650 



tea tea gee tct gtg ttg cct get ctt ctg ggg aaa aaa gtg ctg aaa 2139 
Ser Ser Ala Ser Val Leu Pro Ala Leu Leu Gly Lys Lys Val Leu Lys 
655 660 665 



caa cga gtt aaa gca etc ggg caa att gtg aaa ggg ttc aga agt gga 2187 
Gin Arg Val Lys Ala Phe Gly Gin lie Val Lys Gly Phe Arg Ser Gly 
670 675 680 



att gat ctg ttg ttg get eta cag aag gaa tea aat atg atg agt ttg 223 5 

lie Asp Leu Leu Leu Ala Leu Gin Lys Glu Ser Asn Met Met Ser Leu 

685 690 695 

ctt tat aaa etc tgt aaa ctt cct ata tta gta gga aaa age ggg eta 2283 

Leu Tyr Lys Leu Cys Lys Leu Pro lie Leu Val Gly Lys Ser Gly Leu 

700 705 710 

gag tta ttc ctt tct caa ttc gaa gca gee ata gat age gac cct cca 2331 

Glu Leu Phe Leu Ser- Gin Phe Glu Ala Ala lie Asp Ser Asp Phe Pro 
715 720 725 730 



aat tat cag aac caa gat gtg aca gat gaa aac gc: gaa act etc aca 
Asn Tyr Gin Asn Gin Asp Val Thr Asp Glu Asn Ala Glu Thr Leu Thr 
735 740 745 



2379 



ata ctt ate gaa ctt ttt ate gaa aga gca act caa tgg tct gag gtc 2427 
lie Leu lie Glu Leu Phe lie Glu Arg Ala Thr Gin Trp Ser Glu Val 
750 755 760 

att cac acc ata age tgc eta gat gtc ctg aga cct ttt gca ate gca 2475 
lie His Thr lie Ser Cys Leu Asp Val Leu Arg Ser Phe Ala He Ala 
765 770 775 



gca agt etc tct get gga age atg gee agg cct gtt att ttt ccc gaa 2523 
Ala Ser Leu Ser Ala Gly Ser Met Ala Arg Pro Val He Phe Pro Glu 
780 785 790 

tea gaa get aca gat cag aat cag aaa aca aaa ggg cca ata ctt aaa 2571 
Ser Glu Ala Thr Asp Gin. Asn Gin Lys Thr Lys Gly Pro He Leu Lys 
795 800 805 810 



ate caa gga eta tgg cat cca ttt gca gtt gca gec gat ggt caa ttg 2619 
He Gin Gly Leu Trp His Pro Phe Ala Val Ala Ala Asp Gly Gin Leu 
815 820 825 

cct gtt ccg aat gat ata etc ctt ggc gag get aga aga age agt ggc 2667 
Pro Val Pro Asn Asp He Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly 
830 835 840 
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age act cac cct egg cca ccg :ca ccg acg gga cca aac acg ggc gga 2715 
Ser lie His Pro Arg Ser Leu Leu Leu Thr Gly Pro Asn Met Gly Gly 
845 350 855 

aaa cca acc ccc ccc cgc gca aca cgt ccg gee gec ace ccc gec caa 2763 
Lys Ser Thr Leu Leu Arg Ala Thr Cys Leu Ala Val lie Phe Ala Gin 
860 865 870 

ccc ggc cgc cac gtg ccg cgc gag ccc cgc gaa acc ccc etc gcg gac 2811 
Leu Gly Cys Tyr Val Pro Cys Glu Ser Cys Glu lie Ser Leu Val Asp 
875 880 885 890 

act acc ccc aca agg ccc ggc gca ccc gac aga acc acg aca gga gag 2859 
Thr lie Phe Thr Arg Leu Gly Ala Ser Asp Arg lie Met Thr Gly Glu 
895 900 905 

age acc cct ttg gta gaa tgc acc gag aca gcg cca get ctt cag aat 2907 
Ser Thr Phe Leu Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca acc cag gat tea cca gca acc ccc gac gaa ccg ggc aga gga acc 2955 
Ala Thr Gin Asp Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr 
925 530 935 

age acc ccc gac gga cac gee acc gca cac teg get cct cgc cac ctg 3003 
Ser Thr Phe Asp Gly Tyr Ala lie Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gca gag aaa get caa tgt egg acg ccc ccc gca aca cac cac cac ccc 3051 
Val Glu Lys Val Gin Cys Arg Met Leu Phe Ala Thr His Tyr His Pro 
955 960 965 970 

ccc acc aag gaa ccc gcg ccc cac cca cgc gec acc ccg aaa cac acg 3099 
Leu Thr Lys Glu Phe Ala Ser His Pro Arg Val Thr Ser Lys His Met 

975 980 985 

get tgc gca ttc aaa" tea aga tct gat tat caa cca cgt ggt tgt gat 3147 
Ala Cys Ala Phe Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
990 995 1000 

caa gac eta gtg ttc ttg tac cgt tta acc gag gga get tgt ccc gag 3195 
Gin Asp Leu Val Phe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 1015 

age tac gga cte caa gtg gca etc atg get gga ata cca aac caa gtg 3243 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly lie Pro Asn Gin Val 
1020 1025 1030 

gte gaa aca gca tea ggt get get caa gec atg aag aga tea att ggg 3291 
Val Glu Thr Ala Ser Gly Ala Ala Gin Ala Met Lys Arg Ser lie Gly 
1035 1040 1045 1050 
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gga aac tec aag tea age gag c:a aga ccc gag tec cca age ctg cat 3339 

Glu Asn Phe Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His 
1055 1060 1065 

gaa gac tgg cue aag tea ttg gcg ggt ate tct cga gee gec cac aac 3387 

Glu Asp Trp Leu Lys Ser Leu Val Gly He Ser Arg Val Ala His Asn 
1070 1075 1080 

aat gec ccc aec ggc gaa gac gac eac gac act teg ett egc eta tgg 3435 

Asn Ala Pro He Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp 
1085 1090 1095 

cae gag aec aaa tec tee tac cge gee ccc aaa taaatggcta 3478 
His Glu He Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

egacataaca ctatctgaag cecgeeaage ccceegccec ecegaegccc aetcceceea 3538 

aaaaatgett acataccaaa aaatcg:::c cccgactaaa aaaaaaaaaa aaaaaaaaaa 3598 

aaaaaaaa 360 6 



<210> 31 

<211> 1109 

<212> PRT 

<213> Arabldopsis thai iana ecctype Columbia 

<223> Polypeptide MSH6 

<400> 31 

Met Gin Arg Gin Arg Ser He Leu Ser Phe Phe Gin Lys Pro Thr Ala 
15 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Arg Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 
50 55 60 

Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 
65 70 75 80 

Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 

85 90 95 

Asn lie Mee His Lys Phe Val Lys Val Asp Asp Arg Asp Cys Ser Gly 
100 105 110 

Glu Arg Ser Arg Glu Asp Val Val Pro Leu Asn Asp Ser Ser Leu Cys 
115 120 125 
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Mec Lys Ala Asn Asp Val He Pro Gin Phe Arg Ser Asn Asn Gly Lys 
130 135 140 

Thr Gin Glu Arg Asn His Ala Phe Ser Phe Ser Gly Arg Ala Glu Leu 
145 150 155 160 

Arg Ser Val Glu Asp He Gly Val Asp Gly Asp Val Pro Gly Pro Glu 

165 170 175 

Thr Pro Gly Met Arg Pro Arg Ala Ser Arg Leu Lys Arg Val Leu Glu 
180 185 190 

Asp Glu Met Thr Phe Lys Glu Asp Lys Val Pro Val Leu Asp Ser Asn 
195 200 205 

Lys Arg Leu Lys Met Leu Gin Asp Pro Val Cys Gly Glu Lys Lys Glu 
210 215 220 

Val Asn Glu Gly Thr Lys Phe Glu Trp Leu Glu Ser Ser Arg He Arg 
225 230 235 240 

Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro Leu Tyr Asp Arg Lys Thr 

245 250 255 

Leu His He Pro Pro Asp Val Phe Lys Lys Met Ser Ala Ser Gin Lys 
260 265 270 

Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met Asp He Val Leu Phe Phe 
275 280 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
290 295 300 

His Lys Glu Leu Asp Trp Lys Met Thr Met Ser Gly Val Gly Lys Cys 
305 .310 315 320 

Arg Gin Val Gly He Ser Glu Ser Gly He Asp Glu Ala Val Gin Lys 

325 330 335 

Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg He Glu Gin Leu Glu Thr 
340 345 350 

Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn Thr He He Pro Arg Lys 
355 360 365 

Leu Val Gin Val Leu Thr Pro Ser Thr Ala Ser Glu Gly Asn He Gly 
370 375 380 

Pro Asp Ala Val His Leu Leu Ala He Lys Glu lie Lys Met Glu Leu 
385 390 395 400 

Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 410 415 
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Leu Arg Phe Trp Val Gly Ser lie Ser Asp Asp Ala Ser Cys Ala Ala 
420 425 430 

Leu Gly Ala Leu Leu Met Gin Val Ser Pro Lys Glu Val Leu Tyr Asp 
435 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lys Tyr Thr 
450 455 460 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 

Gly Asp Thr Asp Ala Ala Gly Val Arg Asn lie lie Glu Ser Asn Gly 
485 490 495 

Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 
500 505 510 

Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu lie Asn His 
515 520 525 

Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Gly Asp lie Phe 
530 535 540 

Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg lie Asp Gly Gin Thr Met 
545 550 555 560 

Val Asn Leu Glu lie Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 
565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val Ser Pro Thr Gly Lys Arg 
580 585 590 

Leu Leu Arg Asn Trp lie Cys His Pro Leu Lys Asp Val Glu Ser lie 
595 600 605 

Asn Lys Arg Leu Asp Val Val Glu Glu Phe Thr Ala Asn Ser Glu Ser 
610 615 620 

Met Gin lie Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
625 630 635 640 

Leu Leu Gly Arg lie Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
645 650 655 

Pro Ala Leu Leu Gly Lys Lys Val Leu Lys Gin Arg Val Lys Ala Phe 
660 665 670 

Gly Gin lie Val Lys Gly Phe Arg Ser Gly lie Asp Leu Leu Leu Ala 
675 680 685 

Leu Gin Lys Glu Ser Asn Met Met Ser Leu Leu Tyr Lys Leu Cys Lys 
690 695 700 
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Leu Pro lie Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 
705 710 715 720 

Phe Glu Ala Ala lie Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 
725 730 735 

Val Thr Asp Glu Asn Ala Glu Thr Leu Thr lie Leu lie Glu Leu Phe 
740 745 750 

lie Glu Arg Ala Thr Gin Trp Ser Glu Val He His Thr He Ser Cys 
755 760 765 

Leu Asp Val Leu Arg Ser Phe Ala He Ala Ala Ser Leu Ser Ala Gly 
770 775 780 

Ser Met Ala Arg Pro Val He Phe Pro Glu Ser Glu Ala Thr Asp Gin 
785 790 795 800 

Asn Gin Lys Thr Lys Gly Pro He Leu Lys He Gin Gly Leu Trp His 
805 810 815 

Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp He 
820 825 830 

Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly Ser He His Pro Arg Ser 
835 840 845 

Leu Leu Leu Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu Leu Arg 
850 855 860 

Ala Thr Cys Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr Val Pro 
865 870 875 880 

Cys Glu Ser Cys Glu He Ser Leu Val Asp Thr He Phe Thr Arg Leu 
885. 890 895 

Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 
900 905 910 

Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
930 935 940 

Ala He Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 
945 950 955 960 

Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 
965 970 975 

Ser His Pro Arg Val Thr Ser Lys His Met Ala Cys Ala Phe Lys Ser 
980 985 990 
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Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 
995 1000 1005 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Giu Ser Tyr Gly Leu Gin Val 
1010 1015 1020 

Ala Leu Met Ala Gly lie Pro Asn Gin Val Val Glu Thr Ala Ser Gly 
1025 1030 ' 1035 1040 

-Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 
1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 
1060 1065 1070 

Leu Val Gly He Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 
1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 
1090 * 1095 HOO 

Tyr Cys Val Pro Lys 
1105 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<2 23> Forward primer for PCR amplification of ATHGENEA 
microsatellite 

<400> 32 
accatgcata gcttaaac'tt cttg 



24 



<210> 33 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of ATHGENEA 
microsatellite 

<400> 33 



acataaccac aaataggggt gc 
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<210> 34 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer DMCIN-A for PCR on genomic DNA of Arabidopsis 

thaliana ssp. Landsberg erecta "Ler" 

<400> 34 

gaagcgatat tgttcgtg 18 



<210> 35 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-B for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 35 

agattgcgac aacactcc - 18 

<210> 36 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMCIN-1 for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 36 

acgcgtcgac tcagctatga gattactcgt g 31 

<210> 37 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-2 for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 37 

gctctagatt tctcgctcta agactctct 29 
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<210> 38 

<2ll> 32 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMCIN-3 for PCR on genomic DNA of Arabidopsis 
chaliana ssp. Landsberg erecta "Ler" 

<400> 38 

gctctagagc ttctcttaag taagtgattg at 32 

<210> 39 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN- 4 for PCR on genomic DNA of Arabidopsis 
:haliana ssp. Landsberg erecta "Ler" 

<400> 39 

tcccccgggc tcgagagatc tccatggttt cttcagctct atgaatcc 4 8 

<210> 40 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMCla for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 40 

acgcgtcgac gaattcgcaa gtgggg 26 



<2X0> 41 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMClb for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 41 
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cccacggaga ccccccgggt accgacccgc ttcgaggg 38 

<210> 42 

<211> 20 

<212> DNA 

<213 > Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATEAT1 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 42 

gccactgcgt gaatgatatg 20 



<210> 43 

<211> 22 

<212> DNA 

<2 13 > Artificial sequence 
*<220> 

<223> Reverse primer for PCR amplification of ATEAT1 SSLP marker in 
Arabidopsis thaliana subspecies 

c400> 43 

cgaacagcca acattaattc cc 22 



<210> 44 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 44 

aaccaaggca cagaagcg 1 8 



<210> 45 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA63 SSLP marker in 



Arabidopsis thaliana subspecies 
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<400> 45 

acccaagtga ccgccacc 18 

<210> 46 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA248 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 46 

taccgaacca aaacacaaag g 21 

<210> 47 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA24 8 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 47 

tctgtatctc ggtgaattct cc 22 

<210> 48 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA128 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 48 

ggtctgttga tgtcgtaagt eg 2 2 



<210> 49 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 
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<223> 


Reverse primer for PCR amplif ication 
Arabidopsis thaiiana subspecies 


of 


NGA12 8 


SSLP 


marker in 


<400> 


49 










atcttgaaac 


cctcagggag gg 








22 


<210> 


50 










<211> 


22 










<212> 


DNA 










<213> 


Art i f iri a 1 er£»rmorir"e» 










^ a ^ \j ^ 












<223> 


Forward primer for PCR amplification 
Arabidopsis chaliana subspecies 


of 


NGA2 8 0 


SSLP 


marker in 


<400> 


50 










ctgatctcac 


ggacaatagt gc 








22 


<210> 


51 










<211> 


20 










<212> 


DNA 












Artificial sequence 






















<223> 


Reverse primer for PCR amplification 
Arabidopsis thaiiana subspecies 


of 


NGA2 80 


SSLP 


marker in 


<400> 


51 










ggctccataa 


aaagtgcacc 








20 


<210> 


52 










<211> 


21 










<212> 


DNA 











<213> Artificial sequence 

c220> 

<223> Forward primer for PCR amplification of NGA111 SSLP marker in 

Arabidopsis thaiiana subspecies 

<400> 52 

ctccagttgg aagctaaagg g 21 

<210> 53 

<211> 21 

<212> DNA 

<213> Artificial seauence 
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<222> Reverse primer for PCR amplification of NGA111 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 53 

tgttttttag gacaaatggc g 21 

<210> 54 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 54 

cccccacatc caaaacccac 20 

<210> 55 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 55 

gcacataccc acaaccagaa 2 0 

<210> 56 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA1126 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 56 

cgctacgctt ttcggtaaag 20 
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<210> 


57 






<211> 


20 






<212 > 


DNA 






<213> 


Arc if icial sequence 






<220> 








<223> 


Reverse primer for PCR amplification 
XLi ciu xuLuus x s Liidiiana subspecies 


of 


NGA112 6 SSLP marker 


<400> 


57 






gcacagccca 


agccacaacc 




20 


<210> 


58 






<211> 


20 






<212> 


DNA 






<213> 


Artificial sequence 






<220> 








<223> 


Forward primer for PCR amplification 
Arabidopsis thaiiana subspecies 


of 


NGA3 61 SSLP marker in 




o o 






aaagagacga 


gaatttggac 




20 


<210> 


59 






<211> 


23 






<212> 


DNA 






<213> 


Artificial sequence 






<220> 








<223> 


Reverse primer for PCR amplification 
Arabidopsis thaiiana subspecies 


of 


NGA3 61 SSLP marker in 


<400> 


59 






acatatcaat 


atattaaagt age 




23 



<210> 60 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaiiana subspecies 

<400> 60 
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PCT/EP98/06977 



18 



<210> 61 

<211> 22 

<212> DNA 

<213> Arcificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 61 

gaggacatgt ataggagcct eg 22 

<210> 62 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of AthBI02 SSL? marker 
in Arabidopsis thaliana subspecies 

<400> 62 

tgacctcctc ttccatggag 20 

<210> 63 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of AthBI02 SSL? marker 
in Arabidopsis thaliana subspecies 

<400> 63 

ttaacagaaa cccaaagctt tc 2 2 



<210> 64 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of AthUBIQUE SSLP marker 
in Arabidopsis thaliana subspecies 
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4 1 






<400> 


64 




- 


aggcaaacgc 


ccatttcacc g 




21 


<210> 


65 






<211> 


20 






<2X2> 


DNA 






^ X J 








*> *> f\ -s. 

< ^ ^ u > 








<223> 


Reverse primer for PCR amplification 
in Arabidopsis chaliana subspecies 


of 


AthUBIQUE SSLP marker 


<400> 


65 






acgacacggc 


agatctctcc 




20 


<210> 


66 






<21X> 


21 






<212> 


DNA 






< 213 > 








— *^ /\ 
<220> 








<223> 


Forward primer for PCR amplification 
Arabidopsis chaliana subspecies 


of 


NGA172 SSLP marker in 


<400> 


66 






agctgctccc 


ctatagcgcc c 




21 


<210> 


67 






<211> 


19 






<212> 


DNA 






<213> 


niulilulal SeuuenCc 






<220> 








<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA172 SSLP marker in 


<400> 


67 







catccgaatg ccattgttc 19 

<210> 68 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 
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<223> Forward primer for PCR amplif icac ion of NGA126 SSLP marker in 

Arabidopsis chaliana subspecies 

•;400> S3 

gaaaaaacgc taccttcgcg g 21 



<210> 69 

<211> 22 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA126 SSLP marker in 
Arabidopsis chaliana subspecies 

<400> 69 

caagagcaat atcaagagca gc 22 



<210> 70 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplif icac ion of NGA162 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 70 

cacgcaattt gcatctgagg 20 



<210> 71 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA162 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 71 

ctctgtcact cttttcctct gg 22 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Forward primer for PCR amplif icacion of NGA6 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 72 

tggatttctt cccctcttca c 21 

<210> 73 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA6 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 73 

atggagaagc tcacactgat c 21 

<210> 74 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 74 

aatgttgtcc tcccctcctc 20 

<210> 75 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 75 

tgatgctctc tgaaacaaga gc 22 
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<210> 76 

<211> 21 " 

<212> DNA 

<213> Arcificial sequence 

<220> 

<223> Forward primer for PCR amplification of NGA8 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 76 

gagggcaaac ctccactccg g 21 

<210> 77 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA8 SSLP marker in 
Arabidopsis ehaliana subspecies 

<400> 77 

tggctttcgt ttataaacat cc 22 

<210> 78 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 78 

gcgaaaaaac aaaaaaatcc a 21 



<210> 79 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 79 



NSDOCIO. <WO 9919492A2 1 > 



WO 99/19492 



PCT/EP98/06977 



45 



cgacgaaccg 


acagaattag g 








<L 1 


<210> 


80 










<211> 


21 










<212> 


DMA 










<213> 


Art i f i c i a 1 sequence 










<220> 












<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA225 


SSLP 


marker in 


<400> 


80 










gaaacccaaa 


tcccagagag g 








2 1 


<210> 


81 










<211> 


22 










<212> 


DNA 










<213> 


Artificial ! t ?pmipn'"*p 










<220> 












<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA225 


SSLP 


marker in 


<400> 


81 










tctccccacc 


agttttgtgt cc 










*210> 


82 










<211> 


19 










<212> 


DNA 










<213> 


Artificial sequence 






















<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA24 9 


SSLP 


marker in 


<400> 


82 










taccgtcaat 


ttcatcgcc 








19 



<210> 83 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA24 9 SSLP marker in 
Arabidopsis thaliana subspecies 
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<40C> 83 

ggatccctaa ctgtaaaacc cc 22 

<210> 84 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 84 

aaccccagta accaaacaca ca 22 

<210> 85 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 85 

cccagtctaa ccacgaccac 20 

<210> 86 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA151 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 86 

gttttgggaa gttttgctgg 20 

<210> 87 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 
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Reverse primer for PCR amplif icacion of 
Arabidopsis chaliana subspecies 


NGA151 


SSLP 


marker in 


< 4 0 0 > 


87 








cagcctaaaa 


gcgagagtat gatg 






24 


<210> 


88 








<211> 


22 








<212> 


DNA 








<213> 


Artificial sequence 








<220> 










< J > 


Forward primer for PCR amplification of 
Arabidopsis thaliana subspecies 


NGA106 


SSLP 


marker in 


<400 > 


88 








gttatggagt 


ctctagggca eg 






22 


<210> 


89 








<211> 


20 








<212> 


DNA 








<213> 


Art i f i c ia 1 sequence 








<220> 










_ —> _ 
<22J > 


Reverse primer for PCR amplification of 
Arabidopsis thaliana subspecies 


NGA106 


SSLP 


marker in 


< 4 0 0 > 


8 9 








tgccccatt: 


cgttcttctc 






20 


<210> 


90 








<211> 


20. 








<212> 


DNA 








c213> 


Artificial sequence 








<220> 










<223> 


Forward primer for PCR amplification of 
Arabidopsis thaliana subspecies 


NGA13 9 


SSLP 


marker in 


<400> 


90 








agagctacca 


gatccgatgg 






20 



<210> 91 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Reverse primer for PCR amplification of NGA13 9 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 91 

ggtttcgttt cactatccag g 21 

<210> 92 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA76 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 92 

ggagaaaatg tcaccctcca cc 22 

<210> 93 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA76 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 93 

aggcatggga gacatttacg 2 0 

<210> 94 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 94 

ctccaccaat catgcaaatg 2 0 
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<210> 95 
<211> 21 
<212> DMA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PGR amplification of ATHS0191 SSLP marker 

in Arabidopsis thaliana subspecies 

<400> 95 

tgatgttgat ggagatggtc a 21 



<210> 96 

<211> 22 

<212> DNA 

<213> Artificial sequence 



Forward primer for PCR amplification of NGA129 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 96 

tcaggaggaa ctaaagtgag gg 22 



<220> 
<223> 



<210> 97 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Reverse primer for PCR amplification of NGA129 SSLP marker in 





Arabidopsis 


thaliana 


subspecies 


<400> 


97 






cacactgaag 


atggtcttga gg 






<210> 
<211> 
<212> 
<213> 


98 

8062 
DNA 

Arabidopsis 


thaliana 


ecotype Columbia 


<220> 
<223> 


Genomic DNA 


sequence 


of AtMSH6 


<400> 


97 







"ttttggtt gctaacaata aaggtatacg gttttatgtc atcaatataa ctatatataa 6 0 
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aagaaatgaa agatatatac tgttcttcca tctatcaaac aaaacaacaa gacttcxttt 120 

ttaccct:ca caccggtcaa caaaacacaa gacaaacgac arcgcctaac catctcccaa 180 

ccccacccct aagtttaaca cctagaacct tccccaccct cgcaagcaca gcctgattag 240 

gaacagccct accattctca tactcctgaa ctacctgagt cctcccaccg acctgtttcg 300 

ccaaacccgc ccgcgacacc ttcttctcca atctcgcttt ccgcatcatc aacctcacct 360 

ctgctttcac acgatccatc gccgcaggct ctgtttcttc ccccagcttc ttcgtgttaa 420 

tcaccggaac cgccgtagat cccccctttt tgttcgaacc ggcaccgaac ttcttaaccg 480 

ttcgaaccgc gacaccgtcc cccagagctg cgctaaccgc ccccggatcg cgcaggtctt 540 

ggctctcttg ttctgacttg tggagaacea ctggtcccca gccctgtgtc actgctcctg 600 

ggtatccgct cggcaccgcc gacgaaccga gagaaaggaa caacgcgaaa atcctaccaa 660 

tctgagtctc gaaattgaga aacgacgaag acgaagaacg czgctgagag gactgtgata 720 

tctacacata cgaagattgg tccctggaga acccgaccat c:z:ccctcc atcttcgcct 780 

ctggaacgtc cncagagatg actgacgacg tgtcactatc cgatctgcag ccaaccaacg 840 

cctcccgggc tggattcgtg gcacaccaca ttatccgatt cggctcaatg gctccacata 900 

aatctggccc tcggctcggc tatgagtcat catcaaaact aagccaacca aaaatttccg 960 

taaaacttat ttcggtttca actcggaccc cttaccccca gaaccgaatt acccgaaacc 1020 

ggggttagcc gaaccgaata ccaacgcctg attgactcgc cggccagaaa gatccaacgg 1080 

catacaacaa tagaacataa atcggacggt caccaaagcc ccaaagagcg aacagtcaac 1140 

aaaaaaagtc gagccccgag gagcatcgtt tccgccactt ctacgacgca aggcgaaaat 1200 

ttttggcgcc aatctttccc ccctttcgaa ttctctcagc ccaaaacatc gtttctctct 1260 

caccccccct cacaattcca aaaaacgcag cgccagagat cgaccctgcc tttcttccaa 1320 

aaacccacgg cggcgactac gaagggtttg gtttccggcg acgccgctag cggcgggggc 1380 

ggcagcggag accacgattt aatgtgaagg aaggggatgc caaaggcgac gcctctgcac 1440 

gttttgctgt ttcgaaatct gtcgatgagg ctagaggaac ggacactcca ccggagaagg 15O0 

ttccgcgtcg tgtcctgccg tctggattta agccggccga atccgccggt gacgcttcgt 1560 

ccccgctctc caatattatg cataagtttg taaaagccga tgatcgagat cgttctggag 1620 

agaggtacta accttcgatt ctctcaattt tgtcatcttt agctggaaga agaagattcg 1680 
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cgtaatccgc cgtactcgtc ggagagattc 
ccttcaggag ccgagaagac gttgtcccgc 
acgacgccac ccctcaaccc cgctccaaca 
tcagtctcag tgggagagct gaacttagat 
ctcctggccc agaaacacca gggatgcgtc 
aggacgaaac gacttttaag gaggataagg 
aaatgctcca ggatccggtc tgtggagaga 
aacggcctga gccttctcga atcagggacg 
acgatagaaa gaccttacac ataccacctg 
agcaataccg gagtgttaag agcgaacata 
gtaactatta acccagcgcc caatccatct 
ccacgccacg ccccccccag gggaaacctc 
gtcacaagga gcctgactgg aagacgacca 
attagttgaa acaactggcc egcttgaatt 
tgccccaggt tggtatctcc gaaagtggga 
gcgggcaagg gaaccatcat acttcatgga 
aagaaacgga aaccacttca agcatcatta 
ctattcgtta tataataagt cttttcatca 
tgtgtgctca ccggcaaggc atatataccc 
agttacggat aacatatgtt tacttgtatg 
cttcaagcct tgcgttgaaa ctggatagtt 
gattcatatg cttccctatt ctactaatta 
agatataaag ttggacgaat cgagcagcta 
gctaataccg caagctttcc tggacaggtc 
ttcctttttc cgtacattac tttcacgctg 
tatcagataa tcccaaggaa gctagctcag 
aacatcgggc ctgatgccgt ccatcctctt 
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cgattaccgc attggatcgt cgcctacaaa 174 0 

tgaacgatcc atctccatgt acgaaggcca 1800 

atggcaaaac tcaagaaaga aaccatgctt 186 0 

cagcagaaga tataggagta gacggcgacg 192 0 

cacgtgcttc tcgcctgaag cgagttctgg 198 0 

ttcctgtatt ggactccaac aaaaggctga 204 0 

agaaagaagt aaacgaagga accaaattcg 2100 

ccaatagaag acgccctgac gatccccttt 2160 

atgttttcaa gaaaacgtct gcaccacaaa 222 0 

tggacattgc gcttcccctt aaagcggcca 2 2 80 

ccccaatgcg atctgttcac ttacatctgt 2340 

atgagctgta tgagccagat gcggaactag 24 00 

tgagtggcgt gggaaaacgc agacaggtaa 24 60 

actgtgtcta taaacctcga caccaccttt 2520 

cagatgaggc agtgcaaaag ccattagccc 2 5 80 

attcgtttac tgccacctcg gctaggattc 2640 

gccaggatcc tgagaactca ggatgctctc 27 00 

aggagcaaca aacaaaactt gcacaatatt 2760 

agctaacctt tgccagttca ctgtagcaac 2820 

tggtaccccc attttgtctc tcatggaggc 2880 

acatatgctt ccaacagaaa ctagcatgca 2 940 

tgtattgaca cactcgttgt ttcttctgaa 3 0O0 

gaaacatctg accaagcaaa agccagaggt 3 060 

aaggagagtg ttgcagactg tctttgatca 3120 

taatcaactc aatggctatt ctggtctgat 3180 

gtattaaccc catcaacagc aagcgaggga 324 0 

gctataaaag aggttcgtta tctacttact 33O0 



WO 99/19492 

cacc::atca tgttcagttc atccaagccc 
ccaccaagcc gngtaaagga cttggaatta 
tgcaagaggc ccccttgaaa agatctgtcc 
tttcaagtga atcccctact ttcctacaat 
tatcattcct tgttaccgca tccccctctt 
tctcgtgtgc tggagaaaac gatagctgat 
tgctcaggaa ttgccctcgg ctacattgcc 
ccttcatttc agatcaaaat ggagctacaa 
gttgactgtg ctgccttgag gttctgggtt 
gctcctggag cgttattgat gcaggcaagc 
tgactccctg tgcatatacc cgggtcgcag 
ttttcccagg tttctccaaa ggaagcgcca 
ccagccgttt tgttaaacag aatctaaggt 
atgttgaaat accgcagggc caccaagaga 
gacaggtacc atttcagtag gcaagccaac 
tctctcaaac accgctaatg tagatgacgc 
gtacagttgg ctccagtacc acaagtaacg 
acaatagaac ctaacggata ctttaaaggt 
ggtccaaatg aatgtgatgt tgcccctagt 
aggctaaagg tgtgttggcc tgtttagctc 
tccataactt acagtttcta tctacttgca 
ttttccatac caagtttaca ggggtcgtcc 
tgagatacct aacaatagct gtgatggtgg 
ataacctcaa ctagagggca gacatagaag 
agcaagtaat agccgatttt tgtttaccta 
ctgtgttagt ccaactggca agcgactcct 
tgtagaaagc atcaataaac ggcttgatgt 
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cgaaaaatta cacccctctt taccaa'tctt 3360 

gaaaaccatt acttgatgct ttgttttata 3420 

aagacccttt gcacccgaaa aattcaatct 3480 

gaccatagtc tgcaattgca tgtcaagtaa 3S40 

tcttaatgac cattgtctat gttgtgtttg 3600 

ccaagctgta cattatcatg attaagtagc 3660 

taatggtttg atgtcaattt ttcttccgaa 3720 

aagtgttcaa ctgtgtatgg atttgctttt 3780 

gggtccatca gcgatgatgc atcatgtgct 3840 

aagcgcatcc tgcatcttat gtgtaccacg 3900 

gaaccaaccc cgaatcacca tttggtatgt 3960 

tacgacagea aaggcaaacc gctcgtaccg 4020 

aaatgacact ggccaattta aagtgcacac 4080 

agcacaaaag gccctaagga aatatacgtc 4140 

tgacaatcea accgctcacc gaatgatagg 42 00 

ttatgtccca atctaatagg gtctacggcg 4 „?60 

ggggacacag acgccgctgg agttagaaat 43 2 0 

ccccccgaat cacggaactg tgctgccgat 43 80 

gctcttggag agctaattaa tcatctgtct 4440 

ttgctxttca caaactaagc aaaggaactt 4 500 

gccagaagac gtacttaagc atggggatat 4560 

cagaattgat ggccagacga tggtaaatct 4620 

ccccccaggc aagtgcatat ttcttttctg 4680 

gaaaaattcc aacacttcgt acggatctcc 474 0 

tgtagggacc ttgtacaaat atcttgataa 48O0 

aaggaattgg acccgccatc cactcaaaga 486 0 

agttgaagaa ttcacggcaa actcagaaag 4 92 0 
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catgcaaacc 


actggccagc 


atctccacaa 


acttccagac 


tcagaaagac 


cgctcggacg 


4980 


caccaagcct 


agcgctcgat 


catcagcctc 


tgtgttgcct 


gctcttctgg 


ggaaaaaagc 


5040 


gccgaaacaa 


cgagtaagca 


ccaatcacaa 


gtcttctgag 


caacgccccc 


cacgagtagc 


5100 


ataggactaa 


aacattacgg 


gtctagctaa 


agactgttct 


ccttctcttg 


caatgtctgg 


5160 


ttattcatta 


cacttccctt 


aacctattgc 


attgcaggtt 


aaagcatttg 


ggcaaattgt 


5220 


gaaagggttc 


agaagtggaa 


ctgatctgcc 


gttggctcta 


cagaaggaat 


caaatatgat 


5280 


gagtttgctt 


tacaaactct 


gtaaacttcc 


tacattagca 


ggaaaaagcg 


ggctagagtt 


5340 


acttctttct 


caattcgaag 


cagccataga 


tagcgactct 


ccaaattatc 


aggcgcccat 


5400 


ctatctttca 


cactttacaa 


caaaatgtct 


gtcactactc 


aaagcaatgc 


atatggctta 


5460 


gatctcaact 


cacaccccga 


ggatcctaaa 


gggacctgcc 


ttttattcct 


aacgtttttg 


5520 


gatggtttga 


ctcacttcca 


acttgaactt 


actaaccttg 


taccagaacc 


aagatgtgac 


5580 


agatgaaaac 


gctgaaactc 


tcacaatact 


taccgaaccc 


tctatcgaaa 


gagcaaccca 


5640 


atggcctgag 


gtcactcaca 


ccataagctg 


cctagatgcc 


ccgagatctc 


ctgcaaccgc 


5700 


agcaagcctc 


tctgctggaa 


gcatggccag 


gcccgttatt 


tttcccgaat 


cagaagctac 


5760 


agatcagaat 


cagaaaacaa 


aagggccaat 


acttaaaatc 


caaggaccat 


ggcacccatt 


5820 


tgcagttgca 


gccgatggcc 


aattgcctgc 


cccgaacgat 


atactccttg 


gcgaggctag 


5880 


aagaagcagc 


ggcagcatcc 


atccccggtc 


attgttactg 


acgggaccaa 


acatgggcgg 


5940 


aaaatcaact 


cctctccgtg 


caacacgtct 


ggccgttacc 


tttgcccaag 


tttgtatact 


6000 


cgttagacaa 


ttactccatt 


ctttgcaatc 


agttcttcaa 


catgaataac 


aaattctgtt 


6060 


ttctgtctgc 


agctcggctg 


ccacgtgccg 
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caagttttgt 
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gaactattta 
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6300 


acgcaactca 


ggattcacta 


gtaatccttg 


acgaactggg 


cagaggaact 
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ggaactcgcg tctcacccac gtgtcacctc gaaacacatg gcttgcgcac ccaaatcaag 6600 

accngatcat caaccacgcg gccgcgatca agacccagtg ctcccgcacc gtccaaccga. 6660 

gggagctcgt cctgagagct acggacttca agtggcactc atggctggaa caccaaacca 6720 

agtggttgaa acagcatcag gtgctgctca agccatgaag agatcaattg gggaaaactt 6780 

caagtcaagt gagctaagat ctgagttctc aagtctgcat gaagactggc ccaagtcatt 6840 

ggtgggcatt tctcgagtcg cccacaacaa tgcccccatt ggcgaagatg actacgacac 6900 

tttgtcttgc ttatggcatg agaccaaatc ctcttactgt gttcccaaat aaatggctat 6960 

gacataacac tatctgaagc tcgctaagtc ttttgctcct ccgatgttta ttcctcttaa 7020 

aaaatgccta tataccaaaa aattgtctcc tcgactataa caagattata tatgtatctg 7080 

ccggttcagc tatggtatat aacatatgta tgttcatgag attggccaag agaaatactc 7140 

acaaacagta cattaagaag gaaacatgtt tacgcatcaa ttcaagtttc aagacaaact 7200 

gcaaataacc tcgaccaaag tcgcaaagac caaacacaaa ctacaaaacc tataagactt 726 0 

aagctctgaa ttccctaaaa ccaaaaaaaa aaacagaaca catcttgctg catctacaaa 7320 

caacacaaac ctacatagtt 'tataacttac tcaccactga gactaacatc agaatcattc 7380 

tccatctctc catcttcact ctcatcacca tcaccaccac catgacgatc cccccccccc 7440 

ccacgtaacc tagcaatccc actctgagct ctatcaacaa cccgcccctt ctgcaactcc 7500 

aaatcccccc gaaaatcagc ccccaccctc cccaactcct ccacccgctc tctcttactc 7560 

ctctccatcc tctcataaac cttcccaaac ctctcaacag aatccgccaa caccccatac 7620 

gaagcagcgt cattaacctt cttcctctcg tactcaacct catcatcctc atcctcctcc 7680 

tcttcagaat caccaggact atccatcatc tcatcaaacc caccagactt atctaaataa 774 0 

accttagtgt tcacaaacac aaactcacct gaaccaacac cacaagctaa acctaaatcc 7800 

gacttgggcg aaacacaaag caacatatcc aacttattga aaaacgacca tttacttgaa 786 0 

cctaaacctg atttctcaac cccaatcttc tcttttccat acttcctctt caagccatca 7920 

atcactctcc tacatcgcgc ctcagatttc tccatcctta gctcctcact cactttctca 7980 

gctacttcat tccaatcctc gctcctcaaa ctccttctac ccaattgcaa aaacctatct 804 0 
ccccaaactt caagcaacac aa 80 6 2 
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TrrTTT<;0?TCCTW.CARTAAA<:G7ATACGGTTTTATGTCATCWlTKTAA 

C TAtM«JUUW6AWlfCJ^WATATATTGtTTITTCXTT»ttt»^ 
^ACAPXV^.EACTTTTT1TTTAC7TTTTACA~"GTCKACAAAATACPA 
CATA.V>.CGACATCG7TWATCATTTCCCAATTtTJW:cCC7AW3TTTfcACA 
rCTA0AACCTTCTCCATCTTCCC»AGCACASCCTDAri*CGAACA<3C'rrT 
ACCATTCTCATATTCCTSAACTACCTGAGTCCTCTCATTEATCTGTTTCC 
tCAAATCCGCTTCTGRCATCTTCTTCTCCAATCTCCCTTTCTCTATCATC 
AACCTCTCCTCTGCmCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 
TTCCA<5CTTCTTDEtCTTA.kTCACCGCAACCGCCGTRGAVTTCCCCTTTT 
TGTTCGAACCCGCAtCGA\ATT7CT?AACCGT77GAACCGCGACACCGTTT 
CTCADA&CTGCGTr>ACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 
TTTTGATTTC7<3GAGftACTACTGDTtCCCAG-TCTTGTGtTACTDCt«CTG 
GGTATCTGCTCGDCATCGTCGATGAATTGAGAGAAAGGWlCAACGCGAAA 
oTTTTft* n ATTGAGAAACGRTGAAG A.TGAAGAATG 



GGT»TCTGCTceec»i*»«'i'>' , J« « — ~* — ■- ■ 

ATTTTATTAATCTCAGTTTTC^AATTGAGAAACGRTGAAGWGAAGAATG 

TTGTTGWAGGATTGTCA^AT'rTATATATACGAAGATTGGWCTGGAGA 

R 7TCCATCATCrr:TTCTCCATTTTCCT^TCTGGAACGTrCTTAGAGATG 

ATTGACGACGTGTCATTATCTGATTTGCACl'TAACCAATGirrrrr'JGSGT 

TGGA-rrCGTGGTAfACCATArWTCC^ATTTGGCTCAATGGTTTTATATA 

W TTGGTT--7CGGTTCGGTTATGAr.T7A1CATTAAAATTAAGCTAACCA 

^AAATTTTCGTAAA?.TTTATTTCGG-7TrCAATTCGGATC30-JTACTTCCA 

0^C0AAT1A7TCG.^ACCGGGG7TA"CCGAACCGAATACCAATGCCTG 

AtTGACTCGTTGGCTAGAAAGA-fCCAACGCTATACftATAATAGAACATAA 

ATCGGACGGTCATCAAAGCCTCAAAG.W-GAACAGTCAACAA.^AGTT 

r^GCCCTGMGAGTATCGTTTCCGCCATTTCTACGACGCAAGGCGAAAAT 

aTTTGGCGCCAATCTTTCCCCCCTTTCGAATTCTCTCAGCTCAA.-mCATC 

G TTTCTCTCTCACTC:C7CtCACAATTCCAAAAAATGCA!5CDCCAGAGAT 

CGATTT7CTCT7-rCTTCCAAAA.^CCCACGGCGCCGACT?.CGAAGGGTTTr> 

GTTTCCGGCGATCGTGClAGCGOCGGGCCrGGCAGCGGAGACCACGATTT 

AATGTGAAGCAA4GGGATGCTAAAGGCGACGCTTCTGTACGTTTTCCTG7 

TTCGAAATCTGTCGA7GA<W7TAGAGGAACGGATACtCCACCG(»GAAGG 

7TCCGCGTCGTGTCCTGGCGTCTGGArtTAAGCCGGCTGAATCCGCCGG-I 

GATGCTTCGTCCCTGTTCTCCftATATTATGCATAAGrrrGTAXAAGTCCA 

TGAlOSAGATTGrrCTGGAGAGAGGTACTAATCTTCGATTCTCTtAArrt 

tgttatctttagctggaagaagaagattcetgtaatttgttgtattcgtt 

GGAGAGATTCTaAT7J^TGCAT?G<»TCGTTG7TTACAAATTrKAGGAC 

CCGAGAAGATGTrGTTCCGCTGAATGATTCATCTCTATGTArGAAGGCTA 

AK»TGTTATTCCTCAA7TTCGTTCCAATAA7GGT>^CTt^GAAAGA 

AACCATGCTTTTAGTTTCAGrGGGAGAGCTGAACTTAGATCAGTAGAAGA 

TATADGAGTAGATGGCGA7GTTCCTGGTCCA&AAACACCAGGGATGCGTC 

CACGTGCTTCTC^TGAAGCGAGTTCTGGAGGATCAAATGACTTTTAAG 

GAGI^TAAGGTTCCTCTA^GGACTCTAACAAAAGGWftAAATGCTCCJt 

C<5ATCCGGTTTGl<;GAGAGAW3AAAGAAGT^C«AW3C^CCAAATn& 

AATGGCTTGAGTCTTCTO^TCAGGGATGCCAATAGAAGACGTCCTGAT 

GAT^COCTTTACGATAGAAAGACC7^ACACATACCACCTBATG7 j 7TTCAA 

Figure 11 
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{3AA^TG7C7GGATCACAAAAGCAATATTGGRG7G7TAAGAGTGAATATA 
tGGACATWaCTTTTCTTTAMGTGGTTAfitAAC^ATTJWTCTACTGTT 
CAA7CCAT77CC7CAA7G7GATT7D77CAC77ACA7CTG777ACG77ATG 
CTCTTC7CAG6GGAAA7TTTA7GAGC7G7ATGAGCTAGA7GCCGAATTAG 
GTCACAAGGAGCTTGACTCGAAGA7<ViCCATGAGTCGTG7GGGAAAATGC 

agacaggtaaattac^tgaaacaactggcctgcttgaatta^tgtgtcta 

TAAATTrKACACCACCTTTtGTTTCAGGTTCGTATCTCTGAAAGTGGGA 

tagatcaggcagtgcaaaagctattagctcctgggtaagkaaccatcat 

ACTTTATGGAATTCGTTTACTGCTACTTCGGCTA&GAT7TAASAAATGGA 

aatcacttctjwcatcattagttag^tcctgagaactcaggatgttttc 

TTA7TCGTTATATAA7AAGTCTTTTCATCAA0GAGTAACAAAICAAAACTT 

GCACAATAT7TGTGTGCTCACTGGCAACGCATATATACCCAGCTAACCTT 

TGCTAGTTCAC7S7AGTAACAG7TACGGA7AA7A7ATG7T7AC77STATG 

TGG7ACCCTCATTTTC7C7CTCATGGAGGCTTTGAA6CCTTGTGT7GAAA 

C7GGATAGTTACATATGCTTCCAACAGAAACTAGCA7GCAGAT7CATA7G 

CTTT^CT A7TCT ACTAA7T A7G7 A TTC AC ACAC7CG?TCTTT2 TTTTCAA 

AGATATAAAG77GGACCAA7CCAGCAGCTA^AAACA7C7BACCAACCAAA 

AGCCAOAnD7GC-^.7ACLCTAAGTrTTCTT3GA7AG<i7CAAGGAGAGTC 

T7GCAGACTCT7TT7CA7C?l7T-rCT7T77CTCi7ACA77AC777CArGC7G 

TAA77AACTCAA7GCC7A71 C7CG7 C7 Z A77A7CAG A7 AA77CCAAGGAA 
GCTAGT7CAGG7AT7AAC7CCA7^AACAGCAADCGACCCAAACA7wGCGC 
CTGATGCCG7CCATCT7CTTGCTA7AAAAGAGG777GT7AT7TAC77ATT 
TATCT7ATCA7GTTCAG77CATCCAAGTCC7GAAAAAriACAC7DT7CT7 
XAGCAATCTTCCA7CAAG£7GTGTAAAGGAT77GWAT7AGAAAA7CATT 
ATTTGATCC777G7T7TA7ATCr7AGAGStTCCC77GAAAAGA7CTG7TT 
AAGATTC7TTGttCr7GAAAAAT7CAATCTTri7AA^7CAA7CCCC7AC7 
TTCT7ACAA7GATCATAG7C7GCAA7TGCA7GTCAAG7AftTATCA77CCT 
7G77AC7GCA7CCCCC7C7nC77AA7GACCA77G7CTA7GT7G7ST7TC 
TC7CG7G7GCTGGAGAAAATGA7AGC7GATCDAACCTC7ACATTA1 , CA7G 
A77AAGTAGCTGC7CAGGAA7rGCC777GG77ACA77GCC7AATGG777D 
ATG7CAA7T7VTCTTC7GAAKTTTA77TTAGATCAAARTGGAGC7ACAA 
AAGTGTTCAAC7GTG7A7GGAT77GCTTTTGTT&AC7G7CCTDCC77GAG 
G7-tTTGGG77GGCTCCATCA5CGA7GA7GCA7CA7GTCC7GC7CrrGGAG 
CG7TA7TGATCCAGG7AAGCAA^TGTAT7CTG7A7C77A7GTG7ACCATG 
TGAC7TCCTG7GCA7A7A77tGGG77GCAGGAACTAA7TCTGAATCACCA 
7T7GGTA767777TTCCAGMT7C7CCAAAGGAAG7G7TATATGACAGTA 
AAGG7AAACTGCTT7GTA7CGCCTG7TGTTTTGTIAAACAGAATT7AAGG7 
AAA7GACACTGGTTAATTTAAAGTGCATACATGTTGAAATATTGCAGGGC 
TA7CAAGAGAA6CACAAAAGGCTC7AAGGAAA7A7AC&7TGACAGGTACC 
A777CAG7AGGCAAGCTAACTGACAATTTAACCGCtCACCGAA7GATAGG 
TCTC7TAAACATTGC7AAT€7AGATGATG777A7G7T7CAA7CTAATAGG 
GTCTACGGCCGTACAGT7GGCTCCAG7ACCACAAGTAA7GGGGGA7ACAG 
ATGC7GCTGGAGTTAGAAATATAATAGAA7C7AACGGATAC777AAAMT 
7CT7C7GAA7CATGCAAC7G7^CTG77GATGG7C7AAA7GAATG7GATG7 
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tgccctt^gtgc^cttgcagabctaattaatcatctgtctaccctaaac^ 

tGTCTTG<^TGTTTAGTWTTGCrrn h CAC 
TTCRtAACTTACAGTTTCTATCT^ 

A7GGGGATATTTTTCCATACC7VAGTTTACACC(?CTTGtC7CAGAATT^AT 
CGCC^CGATGGTIU*TCT1G^ 

TCCTTCAGGCAAGTGCATATTTC7tTTT7GATPJlCrTCAACTAGAGG^^ 
GAC^TAGAAGGAAAAAtTCTAATACTTCGTACGGATCTCGAGTAAGTAAT 
AGCC»TTTTTGrrTACCTATCTAGGGACCTT07ACAAATATCVTGATAA 
CTCTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGCATCTGCCATC 

C ACTCAAAG atgtagaaagca t c aat aaacggc t tgatct agttg aag aa 

TTCi^CGGCAPACTCAGAAACTATGC^^TCACTGGCCAGTATCTCCACAA 
ACTTCCAGAGTlAGAAAGACTBCTCGGACGCA'tCAWJTCTAGCGTTCGAT 

catcagcctctgtgttgcctgctcttctggggaaaaaagtcctgaaacaa 

CCACTAAGTATCAATCACAAGTTTTCTGAGTAATGOCT'TCCA^AGTAOT 
ATAGGACTAAAACATTACCGGTCTACC7AAAGACTCTTC7CCTTCTTTTG 
CftATG7CTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGG7T 
AAAGCATTTGGGCAAATTGTGAAAGGGTTCAGAACTGGAATTGATC7GTT 
C?TGGCTCTACACAAGGAA7CAAATATGATGAGTTTGCT77ATAAACTCT 
&TAAA^TTCC^TA7TAr.TAGCk^.AAAGCC«l>C7AGAG7 TATT7 CTTTCT 
CAATTCGAAOCAGCCA7AGATAGCGAC7TTCC.«^AATrATCRGGTCCCCA7 
C7ATCT7TCAT ACT77AC AACAAAATGTC7GTC ACTAC7 CAAAGCAA7 GC 
ATATGGCTTA^7CTCAACTCACACCCCGAGGATCCT.WGGGATTTGCT 
T7TTATTCCTAATG?TTTTI^TGGTTTGATTtATTTr:^AC7TGAACT7 

attaatcttgtaccagaaccaa&atgtgac^agatbaaaacgctgaaactc 

TCACAATACTTATCG>AC77TTTATCGA.^GAGCAACTC.=»A7CGTCTGAG 
G7CA7TCACACCATAJ^CTGCCTAGATGTOCWA^TC77TTGCAA7CCC 
ftGCAAGTCTCTCTGCTGCSAACCATDECCAGGCCTGTTATTTTTCCCGAAT 
CAGAAGC7ACAGA?CAGAA7^AGAAAACAAAAGG<jCCAA7ACT7AAAATC 
.ZAAGGACTATC0CATCCATTTGCAG7TBCAGCCGATGGTCAATTGCCTGT 
?CCGAAT^TATACtCCTTGGCGAGGCTAGAAGAAGCAGTGGCA<;CA7TC 
A7CCTCGGTCA7TGT7ACTGACGGGACCAAACATGGGCGGAAAATCAACT 
CTTCTTCGTGCAA.CATGTCTGGCCGTTATCTTTGCCCAAGTrX^3TATACT 

CGT^T AGATAAT T ACT CT ATTCT? TGC AATCA &7TC T TCAAC AT GAATAAT 
AAATTCTGT5WCTGTC7GCJIGCTT6GCTGCTACGTGCCCTBTGAGTCTT 
GCGAAAtCTCCCTCGTGGAtACTATCTTCACAWGCTTCGCGCATC7GAT 

A^TCATGACAG^&AGAGTAAGTtra^ 

GAACTATT7ACTCAGATWGTCTGATTGGACAAG^TGCTTTtCC77TTT 
7TTAGGTACCT7TT7GGTAGAATGCACTGAGACAGCGTCAGTTCTTCAGA 

AT G CAAC T CAGGA TTCAC TAGTAA TC CT TG AC G AAC 7t G G C ACAG GAAC7 
AGTAC7TTCGATGGATACGCCATTGCKTAC7CGG f TAACC7GCTCTTCTCC 
TTCAACTTATACTtGTTGATCPAGAAAAACATGCAATTCATTTTGCTGAA 
ACTTATT^TTTATATCAGGTTTTTCGTCACCTG^TAGAGAAAGTTCAAt 
GTOGGATGCTC7TTGCAACACAT7ACCACCCTCTCACCAAGSAATTCGCG 
TCTCACCCACGTGTCACCTCGAAAmCATGGCTTGCGCATTCAAATCAA& 
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ATCTCS ATTATC AACCACGTGG7TCTG ATC AAG ACCT ACTGTTCTTGTMC 6*50 

GTTT AACCGAGSG AGCTTG7 CCTGAGAGCT ACGG ACTTCAAGTGGCACTC 5700 

ATGGCTGGAATACCAAACCAAGTGGTTGIAAACAGCATCAQGTGCTCCTCA ^ SO 

AGCCATGAAG AG ATCAATTGGGG AAAACT TC AAGTC AA<3?GA$CTAASAT 65 0 0 

CT€AGTTCTCAAGTCTGCATGAA<3ACTGOCTCAAGTCATTSSTGGGTATT 6B50 

TCTCGAGTCGCCCACAACAATCCCCC^ATTGGCGAAGATGACTACGACAC 6 9 00 

TTTGTTTTGCTTATGGC ATGA^TC AAATCCTCTTACTGTGTTCCCAAAT 535 0 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT 7 00 0 

CTG ATGTTTATTCCTC^ TAAAAAATGCTT AT AT ATC AAAAAATTCTTTCC 7050 

TCGATTAT AACAAGATTATATATGT ATCTGTCGGTTTAGCTATGGTATAT ? 1 DO 

AATATATGTATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 7 1 S E> 

T ATTAAGAAGG AAATATGTTTATGCATTAATTTAAG TT7C AAG ATAAACT 7200 

GCAAAT AACCTCG ACT AAAGTTGCAAAGACCAAACACAAATTACAAAAC^ 7250 

TATAAGACTTAAGTrCTGAATTCCCTAAAACC^^ 7 300 

TATTTTCTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC ? 330 

TCATCACTGAGATTAACATCA5SAATCATTCTCCATTTCTTCATCTTCACT 740O 

CTCATCATC ATC ACCAOCACCATGATG ATTCtCCTCCTCTTCACGTAACC 7*50 

T ACCAATCTC ACTCTCAGCTCTATCAACAATCTGCTTCTTCTGCAACTCX: 7 5 D 0 

AAATCTCrrCTGAAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 1550 

TTTCTTACTCTTCTCCATCTTC7 CATAAACCTTCCCAAACCTCTCAACAG 7 $00 

AATCCGCCAACAT CTTAT AGGAAGC ASC GTCATTAACCTTCTTCCTCTCG ? 650 

TACTCAACCTC AT C ATCCTCATCCTCCTCCT CTTCAGAATC ACCAGGACT "? 7 0 D 

ATCCA7CATCTCATC AAACCCFtfT AGACTT ATCTAAATAAACCTT AGTGT 77 50 

TC ATAAACAC AAACTC ACCTGAATCAACACCACAAGCTAAACCTAAATCC 1 & 0 0 

GACTTtiGGOGAAACACAAAGCAACATATCCAACT , TATTG, B LAAAACGACCA ? B 5 0 

TTTACTTGAACCTAAACCTG/tTTTCTCAACCTTAATCTTCTCTTTTCTAT 7 300 

ACTTCCTCTTCAAGTCATCAATCATT CTCCTACATT5CGTCTCAGATTTC 7 $50 

TCC ATCCTT ACCT CCTC AC?CACTTTCTC AGCTACTTCATPCCAATCCTC 8 00 D 

GTTCCTCAAACTCCTTC^ACCCAATTGCAAAAACCTATCTCCCCAAACTT 3050 
CAASCAACACAA 

Figur* 11 (Continued) 
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Figure 12 



Com™ente/ReMr«no«8; AtM3H3 3' »ide arrtkenae : A1MSH3 V (13 - ziWbp) 
pUCtW13 8al1fM1/T4 Into pCWl04 BamHt/T4 In Agrcbaclsrlum LBA4404 
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Figure 13 



Cos»m«iits/Refer*ncaa: AtMSH6 (SS) 3' slefe antis«ns : 62 SalUSsl1/T4 (1379bp) 
into pCW164 BamHl/T4 
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Figure 14 



Commt»n1smefor»TicBs: AlWSHS d'/AtMSHa 3* antf6«n6B : AIMSK6 (SB) 3' side (62=1379bpl 
Sal1/S6M/T4 Inio PPF13 (pCWW AtWSHS [SB) 9 oid* Ct3-2104> andMfiopSmaU in 
LBA44D4 
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S210 EccRl.Sael. Kpnt 



Figure 15 



Comments/References: AtMSHS 37AtMBK* 3* aniens (0) : A1MSH3 13S) 3' side 
(T9«2l04bp) Sall/SstVT4 into pPFl4 (AtM&Kd (S5) S'elde (*2»iS79bp> BTrtlsfrnfie Into 
pCWtB~4)/Srtiftt. <n LBA440* 
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8210 EcoR 1. Sad. Kpnl 



Figure 16 



Comma ntam^f ere iipbsj AtMSHi (SB> compl»t«. wiw DriBrtatten : pPFSB (M4Sbp) 
Braat Into pCWl64 Sm»i 
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Figure 17 
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Figure 18 
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Figure 19 
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SEQUENCE LISTING 



1 



PCTVEP98/M977 



.:lio> Rhone-Povlenc Agro; Bet2ner r Andreas Stefan; Doutriaux, 
Marie- Pascals,* FreyssinEC, Georges; Perez, Pascual . 

<.120> Methods for obtaining plant vaz-ieti&s 

<-.lSa> 39S49SC 

<150> P0974& 

<:151> 1997-10-10 

<1*0> 93 

<210> i 

*211> 23 

<212> DRA 

<213> Artificial ft^quftnee 
<220> 

*221 > modi f > e.d_ haflft 

c222:- 11 

•x223> I 

<220> 

<-22l> rnodif ietf^base 

-*252> 14 

*223> 1 

<22Q> 

c221> mod if ied_base 

<222> 17 

<222> I 

<220> 

<222> Degenerate oligonucleotides UPMU us©d to isolate AthtSM3 and 

*301> Ra^nan and Kolodner 

<3 02 > Genet i ca 

O03> 132 

<306> 963-973 

O07> 1992 

<400> 1 

ctggatccac nggnccnaay atg 23 



2 

23 



d210> 
<211> 
*2l2> 
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^1 3> Artificial sequence 

<22D> 

<2^l> modif ied_base 

<222> 15 

<:22i> I 

<220> 

<22i> modif ied_base 

%222> IB 

<223> I 

<220> 

^222 > Degenerate ol igonvcleotideB EOMU used bo isolate AtMSH3 and 
AtMSite . 

<30D> 

<301> Reenan and Kolodner 

<302.-» Generics 

O03> 132 

<306> 3*3-973 

<:307> 1392 

rtggatccrt artgngcnrc raa 23 



<2i0> 3 

<211> 24 

<212> DMA 

<213> Artificial sa<guence 
<22D> 

<223> msK3 specific primer 6 36 for PCR using cDNA of Ar&bidopeis 
t ha liana ecotype Columbia 

<400> 3 

tgctagtgcc tcttgcaagc teat 24 



<210> 4 
c2Il> 27 
<2\2> DNA 

c2l5> Artificial sequence 

<220> 

<223> Priioer aPi for pcr uaing c£>HA of Arabidopsis t ha liana ecotype 

Columbia containing Adapter sequence© ligated. to both its 
enda 

<400> 4 
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PCT/EP9&'06977 



27 



<:21D> S 
<2il> 23 
<2l2> DWA 

<2IJ> Artificial sequence 

<220> 

<223> Primer AP2 for ?Cfc using cDWA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences lighted to both its 
ends 

«400> 5 

actcactata gggcfccgagc ggc 23 



<21l> 30 
*212> OW* 

<2\2> Artificial sequence 

<220> 

<223> HSH3 specil'ic primer S525 Eor PCK using cDMA of ArabidopBis 

thaliana ecotype Columbia 

<400> 6 

aggttctgAt cacgcgtgac gctt^actta * B 



^210> 7 

<2ll> 2S> 
<212> 

<213> Artiticial sequence 

<220> 

<223> MBH3 epecific primer S5l for PCR using CDMA of Arabidopaia 
LhalianA ecotype Columbia 

<*on> l 



ggatcgggta ctgggfctttg agtgtgagg 



<21D> 8 

<211> 24 

<212> SNA 

<5l3> Artificial sequence 
<220> 

<223> spaciiiC primer 63 5 for PCR using cDtfA of ArsbidopaiB 

t ha liana ecotype Columbia 
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3 


gcacgtgct t 


gatggtcfttc tcac 


<210> 
<211> 
<2l2> 
*213> 


9 

28 
DMA 

Artificial sequence 


<220> 

<223> 


MSH3 specific primer S523 
thai i ana aeotypa Columbia 


<400> 


9 


tcagacagta 


tccagcatgg cagaagta 


<2H?> 
<211> 
<.2l2> 
<213> 


ID 
33 
ONA 

Art if ici<i 1 sequence 


<220> 
<225> 


KSK3 specific primer 1S5 
thai! ana e cotype C'olumbi a 


<400> 


10 


atcccgggat 


gggcsagcaa aagcagcaga cga 


■c 2 lo> 
<21l-> 
<21Z> 
<21>> 


11 
27 
£>NA 

Artificial Bequeace 


<220> 
<223> 


HSH3 specific primer S53 
thaliana ecotype Columbia 


c40G=> 


IX 


gacaadg&^c 


gaaatgaggc cccttgg 



24 



2ft 



33 



27 



<210> 12 

<211> 1250 

<212> CfHA 

<213> Arabidopcis thai i Ana ecotyp* Coluiribia 

<223> Clon« 52 
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*4D0> 12 

cccgggacgg gcaagcaaaa gcagcragacg acttctcgtt tcttcgctcc caaacccaaa $0 
tccccgactc acgaaccgaa tccggtagcc gaateeccaa caccgccacc gaagatatcc ISO 
gccactgtat ccttctctcc ttccaagcgt aagcttcLct ccgaccacct egccgccgcg 1B0 
tcacceaaaa agcctaaact Ltctcctcac actcaaaacc cagtacccga ccccaattta 24 0 

caccaaagat ctctceagag atttctggsa ccctcgcegg aggaacatgt teccgaaacg 3 00 

ccaccatcga ggaaatacac accattggaa cagcaagtgg tgg&gcta&a gagcaagtac 3 so 
eeagacgtgg t tt cgatggt ggaagttggt tacaggtaca gat 1: etc egg agaagacgcg 420 
gagatcgcag cacgcgtgtt gggratttac gctcatatgg atcacaattt cacgecggcg 4fi0 
A^tgtgcc^ia cntttcgatt gaatxt; treat gtgagaagac tggtgaafcgc ag^atacaag 54 0 

atcggtgeag c.^aagcagsc tgsaaccgca gccattaa^t cccacggcgc aaaccggacc £0D 
ggcccttttt iccggggact gteyyo^L l 3 tataccaaag ccacgcttga agcggctgag 660 
gatataagrg gr.gg^cgtgg tggtgaagaa ggtcLtggtt cacagagtaa ttfccctggtt 720 
egtgcegtgg atgagagagr taagtcgg&g acattaggct gtggtattga aatgagtttt 730 
gatgtcagag tcggtgttgt tggcgt tgaa stLtcgacaq gtgasgcr.gr. tcataaagag 64 0 

ttcaatgata acrceatgag asgcggatta gaggctgtga tttcgsgett. gtcaccagct 9 00 

gagctgtcgc ctggccagcc tctfct^acaa caaaccgag* agttxttggt ggcacatgct 9&0 

ggacctacct caaacgttcg agtggaacgt gcctcactgg aer.o;cttcag caatggtaat 1O20 

gcagtagatg a$gttatCcc attstgtgsa aaaatragcg caggtaactt agaagatgat 1080 

aaagaaatga agrtggagge tgctgaaaaa ggsatgtett gcttgacagt tcatacaatt 114D 

at-gaacatgc cacatctqac tgttcaagcc ctcgccctaa cgttttgcca tctcaaacag 12 00 

bttggatttg aaaggaccct ttaccaaggg gcctrscttc gctctttgtc 12 50 



c210> 13 
<21I> 34 

<213> Artificial aeguOACd 

«22Q> 

<233* >1SK3 specific primer 2SS for PCR using cDWA of Arabidop^is 

thai i ana ecotypa Columbia 



nISDOCIO <WO 9919492A2TI > 



WO 99/1 WS2 PCT/EP9S/D6977 

6 

c40O> 13 

atcc^ggycc aaaacgaaca agtcggcxct agtc 3 4 

*2il> 27 
<212> ONA 

<2ll> Artificial sequence 

<22Q> 

<223> specific primer S52 for PCR using CDNA of Arabidopaie 

c ha Liana ecotype Columbia 

<4D0> 14 

gccacatctg actgttcaag ccctcgc 27 



<:2io-> is 

<211> 2110 

•c2l3> Ar-<»>>i.rti>j>sis tr.nlz una ecotypc Celursfcaa 

*223> Clone 13 



<-900> 


ir> 












gcc&catctg 


actgt tcaag 


ccctcgccct 


aacgtt ttgc 


catctcaaacr 


agtttggaer 


*0 


tgnaaggatc 


ctctsccaag 


gggcc^catt 


Ccgctctttg 


tcaagtaaca 


cagagatgae 


120 


ccc ctcagcc 


aat&etccgc 


aacagtrt^ga 


ggttgtg&aa 


aacaantcag 


atggatcgga 


160 


atetggczcc 


ctsctccata 


Atatgaatca 


cacacttaca 


gratatggtt 


ccaggctccc 


240 


tagatcoctgg 


gcgacccatc 


ctctacgcga 


sagaaac ttg 


atatctgctc ggcttgafcgc 


300 


tgtttctgag 


atttctgcet 


gcatgggatc 


tcatagttct 


t cccagccca 


gcagtgagtt 


3*0 


ggttgaagaa 


ggttctg&ga 


gagcaabtgt 


atcacctgag 


ttttatctcg 


tgctctcctc 


420 


agtettgaca 


gccatgtcta 


gatcatctrga 


t&ttcaacgt 


ggaataacaa 


gaatccttca 


490 


ccggactgct 


aaagecacag 


agttcattgc 


agttatggaa 


gctattttac 


tsgcggggaa 


54 0 


gcaaattrag 


rggcttggca 


taaagcasga 


ctctgaaatg 


aggageatigc 


aatctgcaac 


500 


tgtgcgatct 


actcte ttge 


gaaaattgat. 


ttctgttatt 


tcatcccetg 


t tgtggttga 


€60 


caatgccgga 


aaacttctct 


ctgccctaaa 


taaggaagcg 


gctgttcgag gtgacttgct 


120 


cgacatacta 


atcacttcca 


gcgaccaatt 


tcetgagct c 


gctgaagctc 


gccaagcagt 


7*0 


tttagtcatc 


aggg2aaagc 


tggattcctc 


gstagcctea 


tttcgcaaga 


agcccgctac 


840 
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tcgaaat t tg 


gaatt CCttC 


aagtgccggg 


gatcacacat 


tcgatagagc 


cgcccgttga 


500 


t cccaaggtc 


crtatgaac t 


gggtgaaagt 


aaatagcacc 


aagaagacta 


tficgatatca 




ccccccagaa 


acagcagctg 


gcttggatga 


getageceta 


gesactqaac 


ac cLtgccac 


L020 


tgtgaaccgs 


get tcgtggg 


atagttt cct 


caagagtttc 


agcagatact 


acacs^attt 


10&0 


taaggctgcc 


gttcaagetc 


ttgccgcact 


ggactgtttg 


cactcccctt 


caactctatc 


1140 


tagaaacaag 


aactatgtcc 


gccccgagtt 


tgtggatgac 


tgtgaaccag 


ttgagataaa 


12 00 


catacsgtct 


ggtrgtcatc 


ctgtactgga 


gactatatta 


caagataact 


c<?gtcccaaa 


1260 


tgacaeaatt 


ttgcaegcag 


aaggggaata 


ttgecaaatt 


atcaccggac 


ctaacatggg 


132D 


aggaaagagc 


tgctacatcc 


gtcaagttgc 


tttaatttcc 


ataatggctc 


aggttggttc 


13*0 


ctfctgfcaeca 


gegteatteg 


ccaagccgca 


egtgcetgat 


ggtgttttca 


c t cgQatggg 


1440 


tgct tcsgac 


agt ate cage 




tacctttcta 


gaagaatCaa 


gtgaaccqtc 


X500 


acaca^asr.c 


a^aac^tgr.t. 


cttctcgctc 


qc^tgzeata 


ttagacgagc 


r.r-qqaagaga 




C3 c c a«2Ct* ca 


cacgacgc^tg 


v.agccaccgc 


etatgeaaca 


t tacagcatc 


tcctagcagi*. 


1620 


aaa.g-a.gatg t 


ttggttctt t 




t LacccCgaa 


atagctgsga 


Lcsgtaacgg 


1630 


attcccaggt 


cctgttggga 


cataccatgt 


cccgcacctg 


acaccgcaga 


aggataaagg 


1740 


cagteatgac 


catgatgatg 


cgacctacct 


atataagcex 


gtgcgtggfce 


ettgeageag 


1300 


gagct ttggt 


t ttaaggttg 


ctcagcttgc 


ccagacscct 


ccstcatgta 


tacgccgagc 


1B&0 


cacttcaatg 


getgeaaaar. 


tggaagctg* 


ggcacgcgca 


agagagagaa 


atacaegcat 




gggagaacca 


gaaggacatg 


aagaaccgag 


Aggcgcagaa 


gaatctactt 


eggctctagg 


19*0 


cgactcgtct; 


gcagacctga 


aatttgeect 


ctctgaagsg 


gacecttgga 


aagcactcga 


£040 


gtttttaaag 


catgcttgga 


agattgetgg 


caaaatcaga 


ctaaaaccaa 


cttgttcatt 


2100 


ttgacccggg 












211D 


<210> 
<211> 
<212> 


29 
DMA 













<213> Artificial sequence 

<22Q> 

<22l> M5H3 specific primer S$l fox PCn ueing c£»JA ot Arabidopsis 

thaliana ecotj'pe Columbia 

<400> is 
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3 

ggatcgggta ctgggttttg agtgtgagg 2 3 

c*ll> 30 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> MSH3 specific primer 352 5 for PGR using CDNA ot Arabidopaia 

thaliana ecotype Columbia 

<40O> n 

aggttctgat tatgtgtgac gcttcactta 3 0 



<2lD> IB 

c2Ll> 3522 

<2l2> DMA 

«c2I3> Ara±>±dop&±i; c/zctlinna ecotype Coiuifnbi.dk 
<220> 

<22i> CDS 

*222> (100) .... (3342) 

*22 3> AtMSH3 fcl 1 length cONA and deduced ssiguence of encoded 
polypeptide 

<400> IS 

ecsaagasag cgcgc<?aaaa ttggcaaccc asgtccgcca tagrcacgnc cacgacetcc so 

catttctcrt aaacggagga gattacgaat aasgcaact £9 

atg ggc aag caa dag cag cag acg ate tct cgt ttc ttc get ccc aaa 14 *v 

Wet Gly Lya Gin Lya Gin Gin Thr lie Ser Arg Pbo Phe Ala Pro Lys 
1 5 10 15 

ccc aaa tec ccg act cbc gaa ccg aat ccg gta gec gaa tea tea aca 195 
Pro Lys Ser Pro Thr Hta Glu Pro Asti Pro val Ala Glu ser ser Thr 
20 35 30 

ccg cca ccg aag ata tec gee act gta tec tte tct ect too aag cgt 2*3 
Pro Pro Pro Lys lie ser Ala Thr val ser Phe ser Pro ser Lya Axg 

35 40 45 

aag ett etc tec gac cac etc gee gee gcg tea ceo aaa aag cct aaa 291 
Lys Leu Leu Ser Aap Hie Leu Ala ALa Ala Ser Pro Lys Lys Pro Lya 
50 55 60 

ctt tct cut cac act caa aac cca gta cce gat ccc aat tta cac caa 33 9 

Leu Ser Pro Hia Thr Gin Asn Pro Val Pro Abp Pro Aan Leu Hi a Gin 
55 70 ?5 SO 
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aga ttt ctn 
Arg Phe Leu 



ga3 3 eg tea 
clu Thr set* 



gag eta aag 
Glu Leu Lys 
115 

tac agg tac 
Tyr Arg Tyr 

ttg ggt att 
Leu Gly lie 

cca aca ttt 
Pro Thr Phu 



tac a a ci ate 
Tyr Lys lie 



cac ggt gca 
Bis Gly Ala 
195 

tat acc aaa 
Tyr Thr Lys 
210 

ogt ggt gaa 
■21 y Gly Glu 
225 

gtg gat gag 
Val Aap Clu 



agt ttt sat 
Sir Pha Asp 



gaa gtt gtc 
Glu Val Val 
27S 



cag aga ttt 
Gin Arg Phe 
35 

cca teg agg 
Scr Sar Arg 
100 

age aag tac 
Ser Lys Tyr 



aga ttc ttc 
Arg Phe t>be 



tac get eat 
Tyr Ala Kt3 
150 

cga ttg aac 
Arg Ldu Asn 

ggt gca gtg 
Gly V»l Val 

aac egg ace 
Asn Arg Thr 



gee acg ctt 
Ala Thr Leu 



gaa ggt ttt 
Glu Gly The 
230 

aga gtt aag 

Arg val hys 
245 

gtt aga ejee* 
Val Arg Val 
260 

tat gaa gag 
Tyr Glu Glu 



ctg gaa ccc 
Leu Glu Pro 



aaa tac aca 
Lys Tyr Thr 
105 

# 

cca gat gtg* 
Pro Asp Val 
120 

gga gaa gac 
Gly Glu Asp 
135 

atg gat cac 
Met Asp His 



ttc c&Z gtg 
Phe His Val 



aag cay act 
Lys Gin Thr 
IBS 

ggc ccx ttt 
Cly Pro Phe 

2D D 

gaa ^gcg get 
Glu Ala Ala 
215 

ggt tea cag 
Gly SnT Gin 



teg gag aca 
s«r Glu Thr 



ggt gtt gtt 
Gly val val 
265 

etc aab gat 
Phe Ab n Asp 
280 



CCg ceg gag 
Ser Pro Glu 

Cca ttg gaa 
Pro Leu Glu 



gtt ttg atg 
Val Leu Met 



gca gag ate 
Ala Glu He 
140 

aac etc atg 
Asn Phe Met 

aga aga et g 
Arg Arq Ley 
1.7C 

Ssa act gca 
Clu Thr Ala 



ttc egg gga 
Phe Arg Gly 



gag gat a~a 
Glu Asp lis. 

220 

age aat ttc 
Ser Aen Phe 
235 

tta ggc tgt 
L^u Gly Cys 
Z3Q 

ggc gtt gaa 
Gly Val Glu 



aat ttc atg 
Asn Pha Wet 



gaa tat gtt 
Glu Tyr val 
95 

eag csa gtg 
Gin Gin Val 
110 

gtg gaa gtt 
Val Glu val 
125 

gca gca cgc 
Ala Ala Arg 



acg g eg agt 
Thr Ala Ser 



gtg aat gca 
val Asn Ala 

17^ 

gee att aag 
Ala lie Lys 
130 

ctg teg gcg 
Jjfeu Ser Ala 

205 

ggt ggt 

Ser Gly Gly 



ttg gtt tgt 
Leu Val Cya 



ggt abb gaa 
Gly He Glu 
255 

att teg aca 
He Ser Thr 
2^0 

aga agt gga 
Arg S*r Gly 
2&5 



ccc 38 7 

Pro 



gtg 435 
Val 



ggt 433 
Gly 



gtg 531 
Val 



gtg 57 9 

val 

ISO 

gga $21 
Gly 



tec b*?5 
Ser 



ttg 723 
Leu 



tgt 7-71 

cys 



gtt 319 

Val 

240 

atg 357 
Met 



ggt 915 
Gly 



tta 963 
Leu 
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gag get gtg att ttg age ttg tea cca get gag ctg ttg ctt ggc cag 1011 
Slu Ala Val lie Leu Set Leu Ser Pro Ala Glu Leu Leu Leu <Sly Gin 
250 295 JOO 



cct. cct tea caa ca<i a or. gsg aag ttt ttg gtg gca cat get gga cct 105 9 

Pro L&u Ser Gin Qln Thr Glu Lys l>he Leu Val Ala Met Ala Gly Pro 
305 310 315 320 

acc tea aac gtt cga gtg gas cgt gec cca ctg gat tgt ttc* age aat 1107 
Thr Ser Asn Val Arg Val Glu Arg Ala Sor Leu Asp Cys £he Ser Aan 
325 330 335 

ggt aat gca gta gat gag gtt att tea tta tgt gaa aaa ACC age gca 115 5 

Gly Asn Ala Val Asp Glu Val lie Ser Leu Cya Glu Lys He Ser Ala 
340 345 350 



ggt aac tta gaa gac gat aaa gaa atg aag etg gag get get gaa aaa 12 03 

Gly Asn Leu Glu Aap Asp Lys Glu Met Lys Lou Clu Ala Ala Glu l.y» 

335 360 }6S 

gga atg tct tgc ttg aca gtt cat aaa att atg aac atg cca cat ctg 12S1 

Gly >iec Ser cys Leu Thr Val Hiu Tbr lie Wet Asr) Met Pro Hie Leu 
370 375 3B0 



act gtt caa gec etc gee ct* a^g ttt tgc cat c:c aaa cag ttt gga 12 9 9 

Thr val Gin Ala Leu Ala Lfcu Thr Pht> Cys Bis Leu Lyc Gin Phe Gly 
3S5 330 395 400 

ttt gaa agg ate ctt tac c*a ggg gec tea ttt- cgc r.rc ttg tea agt 134 n 

Phe Glu Arg lie Leu Tyr Gin Gly Ale* Scr Ph<z Arg Ser L«?u Ser Ser 
405 410 415 



aac aca gag atg act etc tea gec aat act ctt* ca<a cag ttg gag gtt 13$ 5 

Asn Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin Lou Glu Val 
420 425 430 

gtg aaa aat aat tea gat gga teg gaa tct ggc tec tta ttc cat aat 144 3 

Val Lys Asn Aan Ser Asp Gly Ser Glu ser Gly ser Leu Phe wis Asn 
435 440 445 



atg aat eac aca ctt aca gta tac get 
Met Asn Kis Thr Lau Thr val lyr Gly 
450 455 

gtg act cat cct eta tgc gat aga aat 
Val Thr HiP Pro Leu Cye Abp Arg Aan 
465 470 

get get tct gag ntt tct gec tgc atg 
Ala Val ser Glu He Ser Ala Cy* Met 
465 



tec agg ctt ctt aga cac tgg 14 93. 
Ser Arg Leu Leu Axg Hi a Trp 
460 

ttg ata tct get egg ctt gat 15J9 
Leu He Ser Ala Arg Leu Aap 
475 4B0 

gga tct cat agt tct tec cag 158 7 
Gly Ser Hi Ser Ser Ser Gin 
490 495 
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etc age sgc gag teg get gaa $aa ggs tct gag aga gca att gta tea 16 3 5 

Lf-u Ser Sftr Glu Leu vai olu Glu Gly Ser Glu Arg Ala He Val 3er 
SOD 505 SIP 



cct gag ttt tat CS-C gtg etc tec tea gtc teg sea get atg cct aga 1633 
Pro Glu Phe Tyr Leu Val Leu 3er Ser Val Leu Thr Ala Met Ser Arg 
SIS 520 B2S 



tea ret gat att csa cgt gga ata aca 
ser Sex Asp lie Gin Arg Gly He Thr 
530 535 

aaa gec gag etc att gca gtt atg 
LyB Ala Thr Glu *he He Ala Val W*t 
£45 550 

aag caa att cag egg cct ggc eta aag 
Lya Gin He Gin Arg i*eu Gly lie Lys 
565 ^ 

atg caa tec gca ace gtg cga tec Act 
Met CLn Ser Ala Thr val Azg Scr Thr 
580 5*5 



aga ate ctt cat egg act get 1731 

Arg He Phe His Arg Thr Ala 
54 0 

gaa get att tta ctt gcg ggg 177 9 

Glu Ala He Leu Leu Ala Gly 
555 S60 

caa gac tec gaa atg agg agt 1 827 

Gin Aap 3er Glu Met Arg Ser 
570 575 

ctt ttg aga aaa ttg att tct 1B?S 
Leu Leu Arg Lys Leu lie £er 
S90 



gtt a^T. tea tec ccz, gr.t gcg gtY- gac aat gec gga aaa ctt etc tct l$2S 

Val lie Scr Ser Fro Val Val Val Aap A&n Ala Gly Lya Leu Leu Ser 
595 600 

gec ct3 aat aag g<sa gcg get gtt ega. ggt gac ttg etc gac ata eta 1&71 

Ala L.<30 ASU Lys Glu Ala Ala Val Arg <Sly Asp Leu Leu Aep He Leu 
610 615 £20 

stc act tec age gac caa ttc coz gag ctt get gaa get cgc caa gca 2015 

I la Thr Scr Ser Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 

625 630 635 £40 



gtt Zta gtc ate agg gaa aag ctg gat tec teg ata get tea ttt cgc 2067 
Val Leu Val He Arg Glu Lya Leu Asp flar Ser lie Ala 3er Phe Arg 
645 650 $55 



aag aag etc get att cga aat ttg gaa ttt ctt caa gtg teg ggg ate 2115 
Ly& Lya Leu Ala He Arg Am Leu Glu Phft Leu Gin Val Ser Gly He 
€60 €65 «70 

aca cat ttg ata gag ctg ccc gtt gat tec aag gtc cct atg aat tgg 2iC3 
Thr His. Leu lie Glu Lbu Pro Val Aap Ser Lys Val Pre* His Aan Trp 
$75 6 80 £95 



gtg aaa gta aat age acc aag aag act act cga tat cat ccc cca gaa 22x1. 
Val Lys Val Asn Ser Thr Lya Lya Thr He Jirg Tyr His Pro Pro GU» 
690 5*5 700 
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ata gta get ggc ctg gat gsg c;&a get eta gca act gaa oat ctt gee 225? 

I Lb Val Ala Gly Lau Asp Glu Leu Ala Leu Ala Thr Glu Hia Leu Ala 
"?DS 710 7L5 720 

act gtg *ac ega get teg tgg gac agt ttc etc sag agt ttc agt aga 2307 
lie Val abji Arg Ala Ser Trp Asp $er Phe Leu Lys Ser Phe Ser Arg 
7 25 730 735 

tac tac aca gat ttt aag get gec get cad get ctt get gca ctg gac 235 5 
Tyr Tyr Thr Asp Fhft Cys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 74S 750 

tgt ttg cac tec ctt tea act eta tct aga aac aag aac tat gtc cgt 
Cys leu Hia ser Leu Ser Thr Leu ser Arg Aen Lys Asn Tyr Val Arg 
155 750 765 

ccc gag ctt gtg gat gae tgt gaa cca gtt gag ata aac ata cag tct 2451 
Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu lie abji Xle Gin ser 
770 77S " 7BO 

ggc cgs cat cct gra cr.g gag aci aca tta caa gac aac ttc gte oca 24 9 9 

Gly Axg His Pro Val Lea Glu Thr- ile Leu Gin Asp Asn Pb* val Pro 
7SS "*S»0 795 900 

aat gac aca ace ttg rat gcra g** sgg 9** cat tgc caa att ate ace 254? 
Asn Asp Thr lie Leu Bis Ala Glu Gly Glu Tyr Cys Gin lie lie Thr 
605 810 Bl$ 

gga cct aac atg gga gga aag age cgc tac ate cgt caa gtt get tta 259 & 

Gly Pro Asn Met Gly Gly Ly& S«r Cys Tyr lie Arg Gin Val Ala Leu 
S20 830 

act ccc ata atg get cag gtt ggc ccc tzz gca cea gcg tea ttc gec 2643 
lie Ser He Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
B35 B40 845 

aag ctg cac gtg ctt gat ggt gtt ttc act egg atg ggt get tea gac 2591 
Lya Leu His val Leu Asp Gly Val Phe Thr Arg Met Gly Ala Ser Asp 
B50 355 9 SO 

age ate cag cat ggc aga Agt acc ttt eta gaa gaa tta age gaa gcg 273 9 
Ser lie Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
3*5 370 675 $g D 



tea cac ata ate aga ace cgc tct tct cgt teg ctt gtt ata tta gat 
Ser Hia lie lie Arg Thr Cye Ser Ser Arg Ser Leu Val lie Lem Asp 
B85 B90 

gag cct gga aga ggc act age aca cac gac ggt gta gee att gee 
GlU Leu Gly Arg Gly Thr Ser Thr His Aap Gly Val Ala He Ala 
900 ?P5 910 



2787 



tat 2B35 

Tyr 



5DOCIO: <WO 9919492A2TI > 



WO 99/1 9492 



PCT/RP9&70697'? 



13 

gca aca Lt3 cag cat etc eta gca gaa aag aga cgt ttg gtt ctt tct 26 B 3 

Ala Thr Leu Gla His Leu L u Ala Glu JLys Arg CyS L6U Val Leu Phe 
91S 320 525 



gtc acg cat tac cct gaa eta get gag ate agt aac gga tec cca ggt 2931 

Val Thr Kia Tyr Pro Glu Ll<s Ala Glu lie Ser Aan Gly Phe Pro Gly 

930 935 940 

tct gtt $gg aca tac cat gtc teg tat ctg aca ttg cag aag gat aaa 2 97 9 

Ser Val Gly Thr Tyr Kia Val Ser Tyx Leu Thr Leu Gin Lya Asp Lya 
$45 95S 960 

ggc age tat gat cat gat gat gtg aee tae eta tat aag ctt gtg cgt 3057 

Oly $er Tyr Aap His Asp Asp Val Thr Tyr Leu Tyr Lyfc Leu Val Arg 
965 970 ?>7S 



ggt Ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gec cag 3075 
Gly Leu Cys Ser Aarg Ser Pbe Gly Phe Lys VaL A La Gin Leu Ala Gin 
930 995 990 

aca cec cca tea tgt ata cgt cga gec act tea acq get gca aaa ttg 3123 
lie Pro Pro- Ser Cys Ha Arg Arg Ala Tie Ser Met Ala Ala Lys ueu 
99S 1000 1005 

gaa gee gag gta cgt gca aga gag aga aa: aca cgc atg gga gaa cca 3171 
Glo A1& Glu val Arg Ala Arg t;lu Avg Asn Thv Arg Met Gly Glu Pro 
1Q10 1015 1020 

gaa gga cat gaa ^aa ccg aga ggc gca gaa gaa tct att teg get eta 3 219 

Glu Gly Ki£ Glu Glu Pro Arg Gly Ala Glu Slu He Ser Mb Leu 

102S 1030 103S 1040 

ggt gac ttg ttt gca gac ctg aaa ttt get cr,c ten qaa gag gac cct 32S7 
Gly Asp Leu Phe Ala Asp Leu Lys Ph* Ala Lou Ser Glu Glu Asp Pro 

10S0 1055 



tgg aaa gca ttc gag ttt tta aag cafc get tgg aag atfc get ggc aaa 
Trp LyB Ala Phe Glu Phe Leu Lys Kie Ala Txp Lys He Ala Gly Lya 
1060 1065 10^0 

ate aga eta aaa cca act tgt teak ttt tgatttaatc tcaacattat 
lie Arg Leu Lye Pro Thr Cys Ser Phe 
1075 1DS0 

ageaactgea aggtcttgat catctgttag ttgegtacta scttatgtgt attagtataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt tgcagtacat cttttctcca 34 B 2 

cccgcataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3522 



2262 



<2l0> 19 
<211> 10B1 
<312> PJCT 
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<2i3* Ar&bldop&is thai iana ecotype Columbia 

<32y* Polypeptide MSN 3 



Met Gly Lys Gin 
1 

Pr Lys Ser Pro 
20 

* Pro Pro Pro bys 

35 

Lys Leu Leu Sot 
SO 

Lau Sdt Pro Kia 
€5 

Arg Phe Leu cm 



Glu Thr ser ser 

100 

Glu Leu LyB Ser 

Tyr Arg Tyr Arg 
130 



Lys Gin Gin Tnr 
Thr Hi a Glu Pro 



lie Ser Ala Thr 
40 

Asp Hi 5? Leu Ala 

Thr Gin A3i* Pro 
70 

Arg P)"w? Leu Glu 

Ser Arg Lys Tyr 



Lys Tyr Pro A&p 
120 

Pha PJV& Gly Glu 
13S 



lie Ser Arg £>he 
10 

Aba Pro val Ala 
25 

vsl ser phe ser 



Ala Ala ser Pro 
£0 

val pro Asp Pro 
75 

Pro Ser Pto Glu 
90 

Thr Pro Leu Glu 
IDS 

Val Val Leu Met 



Asp Ala Glu Tie 
140 



Phe Ala Pro Lys 

15 

Glu Ser Ser Thr 
JO 

Pro sex Lya Arg 
45 

Lys LyB fro Lye 



A$n Leu His Gin 
BO 

Glu Tyr Val Pro 

95 

Gin Gin Val Val 

LID 

V*l Glu VaI Gly 
125 

Ala ala Arg val 



L*u Gly lie Tyr Ala 
14S 

Pro Thr Phe Arg Leu 
165 

Tyr Lys He Gly Val 

KiB Gly Ala Asn Arg 
195 

Tyr Thr Lys Ala Thr 

210 

Gly Gly Glu Glu Gly 
225 

val abp Glu Arg val 

245 



His Nee ARp His Acn Phe 
ISO 155 

Asa Phe His val Arg Arg 
170 

Val Lys Gin Thr Glu Thr 
las 

Thr Gly Pro Phe Phe Arg 
200 

Leu Glu Ala Ala Glu Asp 
215 

Phe Gly Ser Gin 5*r Asn 
230 235 

Lye ser Glu Thr Leu Gly 
250 



Me*C Thr Ala ser Val 

160 

Leu val Asn Ala Gly 

175 

Ala Ala He l/ys ser 

190 

Gly Leu Ser Ala Leu 
205 

He Ser Gly Gly eye 
220 

Phe Leu Val Cyd Val 

240 

Cys Gly He Glu Ket 
255 
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Ser Phe Asp Vai. Arg Vul Gly Val Val Gly Val Glu Lie Ser Thr Gly 
260 265 270 

Glu VJtl Val Tyr ulu Glu Phe Asn Asp Asn Phe Hat Arg Ser Gly Leu 
275 230 225 

Clu Ala Vol lie Leu So* Leu Ser Pro Ala Gltf Leu Leu Leu Gly Gin 

230 295 SOD 

Pro Leu Ser Gin Gin Thr Glu Ly& Phe Leu val Ala Mat Ala Gly Pro 
305 310 315 320 

Thr Sex Aan Val Arg val Glu Arg Ala Ser Leu Asp Cya Phe Ser abji 
325 530 335 

Gly Asn Ala val Asp Glu Val lie Ser Leu Cys Glu Lys Tie Ser Ala 
340 3*5 3S0 

Gly Asn L*u Glu A*3> Asp Lya Glu Wet LyS Leu Glu Ala Ala Glu Lye 
355 3AO 3*5 

Giy Mftt Ser O/s Le;> Thr Val Kis Tb.r 1.1* Met Asn Mot Pro His Leu 
310 375 300 

Thr V3l Gin Ala. r..eu Ala Leu Tiir Phe Cys Hi a Leu Lys Gin Phe Giy 

IBS 350 395 300 

Phe Glu Arg lie Leu Tyr Gin Gly Ala &er Phe Arg Ser Leu Ser Ser 
405 410 415 

Asn Thr Glu Met Thr Leu ser Ala Asn Thr Leu J5ln Gin Leu Glu Val 
420 425 430 

VnL Lys Asm Aan Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe Hie Asn 
43S 440 445 

>1et Asa His Thr Leu Tlir val Tyr Gly Ser Arg Leu Leu Arg Hie.Trp 
450 455 460 

Val Thr His Pro Leu Cya Asp Arg Abu Leu lie Sat Ala Arg Leu Asp 
465 470 475 4fi0 

Ala Val Ser Glu lie Ser Ala Cys Mat Gly Ser His Ser 6©r Ser Gin 
4*5 490 495 

Leu Ser Sfer Glu Leu Val Glu Glu Gly Ser Glu Arg Ala He Val Ser 
500 505 510 

Pro Glu Phe Tyr Leu val Leu ser Ser val leu Thr Ala Met Ser Arg 
515 520 525 

Ser Ser ABp He Gin Arg Gly Tie Thr Arg lie Phe His Arg Thr Ala 
530 535 540 
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Lys Ala Thr Glu Phe lie Ala val Met Glu Ada lie Leu Lsu Ala Gly 
545 550 555 560 

Lys Gin He Gin Arg Leu Gly lie Lya Gin Asp Ser Glu MeC Arg Ser 
56S 570 575 

l*et Gin Ser Ala Thr Val Arg Ser Thr Leu LftU Arg Lys Leu He Ser 
5^0 565 590 

Val He Sex Ser Pro val Val val Asp Asn Ala Gly Lys Leu Leu Ser 

595 600 605 

Ala Leu ASH Lys Glu Ala Ala Val Arg Gly Asp Leu Lbu Asp Ila Leu 
610 615 620 

lie Thr Ser Ser Asp Gin Phe Pro Glu Leti Al* Glu Ala Arg Gin Ala 
625 630 635 $40 

Val Leu vaI He Arg Glu Lys Leu Asp Ser Sex He Ala Ser Phe Arg 

645 650 $55 

Lys Lys Leu Ala lift Arg Assn L-su Clu Phe Leu Cln Val Ser Gly He 
660 66i» 670 

Thx His Leu rle Glu Leu Pro val A£*p Ser Lya Val Pro His Aon Trp 

6&0 sas 

Val Lys Val Asn Ser Thr Lys Lys Thr He Arg Tyr His Pro Pro Glu 
690 695 700 

He Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu Hi a Leu Ala 
705 7X0 71S ISO 

He val Asn Arg Aid Ser Trp Asp Ser Phe Leu Lys Ser Pb* 3sr Arg 
725 7S0 735 

Tyr Tyr Thr Asp Phe Lys Aid Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lya Asn Tyr Val Arg 

760 765 

Pro Glu Phe Val Asp Asp Cye Glu Pro Val Glu He Asn He Gin Ser 
770 775 790 

Gly Arg His Pro Val Leu Glu Thr lie Leu Gin Aap Aan Phe val Pro 
7 &5 790 795 800 

Asn Asp Thr lie Lbu Hia Ala 01u Gly Glu Tyr Cys Gin He He Thr 

$QS 810 315 

Gly Pro Asn Met Gly Gly Lys Ser Cya Tyr He Arg Gin Val Ala Leu 

825 330* 



SDOCIO. <WO 9919A92A2TI > 



' WO 99/19492 PCT/BPW0OT77 

17 

lie Ser lie Mot Ala Gin Val Gly Ser Phe V*l Pro Ala Set Phe Ala 
B35 840 845 

Lys Lpu Hia Val Leu Asp Gly val Phe Thr Arg Met Gly Ala Ser asp 
950 955 860 

ser He Gin His Gly Axg ser Thr Phe Leu Gla Glu Leu Ser Glu Ala 
a6 5 3?0 875 *B0 

Ser Hla lie lis Arg Thr Cys Sar Ser Arg Ser Leu val He Leu Asp 
BBS 890 895 

Glu L*u Gly Arg Gly Tbr Ser Thr Mia Asp Gly Val Ala lie Ala Tyr 
900 9DS 910 

Ala Thr Leu Gin Hia Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 * 25 

val Thr His Tyr Pro Glu He Ala Glu He Ser Aen Gly Phe Pro Gly 
?30 ?35 940 

■s~r Vftl G\y Thr Tyr His Val Ser Tyr be* Thr Leu Glr> Lya Asp Lys 

950 95* * 6C> 

Gly S«r Tyr Asp Bis Asp Asp V*l Thr Tyr Leu Tyr l,ya Leu Va) Arg 
965 9-70 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
930 9B& S&0 

He Pre Pro Ser- eye lie Ar 9 Arg Ala He Ser Met Ala Ala Lye Leu 
995 1000 1Dt>s 

Glu Ala Glu vsl Arg Ala Arg Glu Arg Asn Thr Arg Met Gl.y Glu Prt> 
iOiO 1015 L(>2,> 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu ser He ser Ala Leu 
1025 1030 1040 

Gly Asp L*u Pha Ala Aep Leu Lye Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 10*5 

rrp Lye Ala Phe <3lU Pbe Leu Lys His Ala Trp Lys He Ala Gly Ly& 
10 g0 1065 1070 

He Arg Leu Lya Pro Thr cya Sex Phe 
1075 ioeo 



<210> 23 
<211> 24 
<212> £>NA 

Artificial Bequence 
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<220> 

<22l> specific primer 63 8 for PCR using coma of AraJbidopsie 

thalisna ecotype Columbia 



<400> 


20 




gcgacgaaaa accg 


<2l0=> 


21 


■c211> 


28 


<212> 


DMA 




Artificial sequence 


<220> 




<223> 


Pruner S31 for PCR 




Columbia 


<4D0> 


21 



24 



cgtcgccttt agcatccccC tcftttcac 2S 



<210> 
<2ll> 
<2\2> 
<2L3> 


22 
30 
DMA 

Art if I cial sequence 


*220> 
<223:> 


msh€! specific primer 5323 
the* liana ecotype Columbia 




22 


gcttggcgca 


c c t aa taga a t ca t ga csgg 


<210> 

<2tx> 

<212> 
<2l*> 


2* 
24 
DMA 

Artificial sequence 


<220> 
<223> 


MSH6 specific primer 637 
t ha liana ecotypfc Columbia 


<400> 


23 



30 



gacagogtsca gttctte*ga arge 24 



<210> 24 
c211> 35 
<212> DKTA 
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<213>> Artificial sequence 



<220> 



M£H(> specific primer l£S for FCR usia^ CDNA of Arabidopsi: 
thftliana ecotype Coluimbia 



ettccc^ggat gcagcgrcag aga tog ante, tgt 33 



<2l0> 25 

*:211> 2? 

<212> DMA 

<2i3* Artificial sequence 
<220* 

<223> M3H$ specific prinw.r SR3 for PCR uaing cDNA of Arabldopeia 
t ha Li cm* ef;C*typc Columbia 

c4GD> 25 



cgctatetat 


ggctgct teg 


aattga^r 








27 




2S 












<211> 














<212> 
















Arsb i dope i s tha liana 


. ecotype Columbia 






<223> 


ClCfi* 43 


























cccggg-atgc 


agegecagag 


atcgat^ttg 


tctttcttcc 


aaasacccac 


ggeggegact 


SO 


acgaagggtt 


tggtttccgg 


egatgetget 


ageggegggg 


geggcagegg 


aggaccacga 


120 


LCtaatgtga 


aggaagggga 


tgetaaagge 


gacgctcctg 


tacgttttgc 


tgtttcgaaa 


180 


tctgrcgatg 


aggttagagg 


aacggatact 


ccaccggaga 


aggttccgcg 


tcgtgtcctg 


240 


ccgcetggat 


ttaagcrggc 


tgaatccgrc 


ggegatgett 


cgtccctgtt 


ctccaatatc 


300 


atgcataagt 


ttgtaaaagt 


cgatgatcga 


gattgttctg gagagagga«g 


ccgagaagac 


360 


gttgtcccgc 


tgaatgattc 


atctctatgt 


atgaaggeca 




ccctcaattt 


420 


egtcccaata 


atggtaaaac 


tcaagastaga 


aacratget t 


ttagcttcag 


fcgggagagct 


480 


gaacctagat 


cagtagaaga 


tataggzvgta 


gatggcgatg 


ttcctggtcc 


aga a a caeca 


£40 


gggatgcgtc 


cacgtgcttc 


tegcttgaag 


cgagfctctgg 


aggatgaaat. 


gacttttaag 


£00 


■^aggataagg 


tcecegtatt 


ggactcxaac 


aasaggctga 


aaatgctccai 


ggatccggtt 


660 
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cgr.g^agaga 


<sgaaagaagt 


aaacgaegga 


acc&aat ttg 


aatggcctga 


gtcttctcga 


720 


atcagggatg 


ccaatagaag 


eicgtcctgaC 


gatccccctt 


acgatagaaa 


gaccttacae 


730 




atgtcttcaa 


gaaaatgtce 


gcatcac&aa 


agcaaeattg 


gagtgttaag 


840 


agtgaacata 


tggacattgt 


gccnttcctt 


aaagtgggga 


asttttatga 


gctgtatgag 


9 DO 


etagatgegg 


aattaggtca 


caaggagctt 


gactggaaga 


tgaccatgag 


tggtgtggga 


36D 




aggttggtat 


ctctgaaagt 


gggatagatg 


aggcagtgea 


aaagctatta 


1020 


gcccgtggat 


ataaagttgg 


aogaatcgag 


cagctagaas 


cacctgacca 


ageaaaagee 


1030 


agagg tgeca 


atactica at 


cccaaggaag 


ctagttcagg 


tatcaatftco 


atcaacagca 


1140 


agegagggaa 


acateggger 


r.qatgccgt.c 


catcctctfcg 


ctaca&aaga 


gaccaaaatg 


12D0 


gagctacaaa 


agtgtecaac 


tgtgtatgga 


t ctgcttttg 


ttgactgtgc 


tgecttgagg 


1260 


tt ttgggt tg 


ggtctutcag 


rjgar.gar.^t:^ 


Lcatgtgocg 


cccteggagc 


gttatrzgacg 


1320 




caaaggaagt 


gttatat^ac 


agcaaagggc 


tatcaagaga 


agcacaaaag 


13SO 


gceccaagga 


aatatacgee 


gacagggncc 


acggcggcac 


agtbggctcc 


acjcaccacaa 


1440 


gtaatggc>gg 


atacagatgc 


tgctggagtt 


agaaatataa 


tagaatctaa 


eggataettt 


1500 


flAAggccct v 


ciigaa&catg 


gaactgeget 


gttgatggtc 


taaadgaatg 


tgatgttgcc 


1550 


cUag;gctc 


t-Cggaga-sot 


astcaatcat 


ctgtctaggc 


taaagctaga 


agatgtacct 


1520 


aagcatgggg 


atatctctcc 


ataccaagst 


racaggggtt 


gectcagaat 


cgatggccag 


1680 


aegaeggtaa 


ascttgagat 


atttaacaac 


agctgtgacg 


gtggtCCfctc 


agggaccttg 


1740 


tacaaatatc 


ttgataactg 


tgttagtcca 


actggtaagc 


gaccctcaag 


gaattggatc 


1BD0 


tgccatccac 


tcaaagatgt 


sgaaagcatc 


aataaaegge 


ttgatgeage 


tgaagaattc 


1B60 


aeggcaaact 


cagaaagtat 


gcaAAtcacrt 


ggecagtate 


tccacaaact 


tccagaccta 


1920 


gaaagactgc 


teggaegcat 


caagtctagc 


gttegatcat 


cagcctctgt 


gttgectget 


19&0 


ettctgggga 


aaaaagtgct 


gaaaeaacga 


gttaaagcat 


ttgggcaaat 


tgrgaaaggg 


2040 


ttcagaageg 


gaattgafcee 


gttgeeggct 


ctacagaagg 


aaccaaacat 


gatgagtttg 


2100 


ctctataaac 


tctgfcaaact 


tcctaeatta 


gtaggaaaaa 


gegggctaga 


gttatttctc 


2160 


tctcaattcg 


aagcagecat 


agat3gcg 








2193 
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<2ll> 1*85 

<212> DMA 

<2i3p- Arabi^opBiB t ha liana ecotype Columbia 

<223> ClOrtQ 62 

<400> 27 



catcagcctc tgtgttgcct gctcttctgg ggaaaaaagt gctgaaacaa cgagttaaag 60 

-CAtCtgggca aattgtgaaa gggttcagaa gtggaattga tctgttgttg gctctacaga 120 

aggaateaaa tatgatgagt ttgcxetata aactctgtaa acttcecata ccagtaggaa 130 

aaagcgggcC agagtcatet ctttcccaatt tcgaagcagc catagatagc gactttcc&a 240 

actatcagaa ccaagatgtg acagetgaaa acgctgaaac tctcacsata cttatcgaac 300 

tttttatega aagagcaact eaatggtctg aggtcac^ca caccataagc cgectagatg 36 0 

tccxgagatc CCCtgcaatc gcagcaagtc tcC^tgctgg aaqcacggcc aggcctgtrr.^ 43 D 

Lttttrccga atcagaagct acagatcaga £ccagaa&ac aaaagggcca atacttaaaa 4 B0 

tccaaggact. a egg cat cc a tttgcagccg eagccgotgg ccaactgcct gttccgaatg 54 G 

atitactcct tggcgaggct agaagasgca gtggcagc3t teatccccgg tcactgttac 6 00 
tgacggga.cc aaacaegggc ggaa&atcaa otectettcg tgcaacatgt ctggccgtta 

tccctgccca acttggctgc tscgtgccgc gtgagtcttg cg&aAtCtcc ctcgtggata 7 20 

ctaccctcao aaggcttggc geatctgata gaatcacgac aqgagagagn ^ccttttrgg 7 B0 

tagaatgcac cgagaeagcg tcagttcttc ag&atgcoac tcaggsttcs ctagtaatcc B40 

tt^acgaact gggcagagga actagtactt tcg&fcgga&a cgccattgca tactcggttt 9 00 

tfccgtcacct ggtagagaaa gttc^atgtc ggatgc*ctt cgcaac&cat taccacecrc 96 0 

tcsccaagga attcgcgtcc cacccacgtg tcaccccgaa acacatggct tgegcattca 102 0 

aatcaagatc tgafctatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 108 0 

taacqgaggg agcetgtcct gagagctacg gaettcaagt ggcactcatg gctggaatac 114 0 

caaaccaagt ggttgaaaca gcatcaggfcg ctgctcaagc catgaagaga tcaattgggg 120 0 

aaaactteaa gtsaagtgag etaagatctg agtit.et.caag trtgcatgaa gactggctca 1260 

agccattggt gggtatttet cgagtcgccc acaacaatgc ccccattggc gaagatgact 13 2 0 

acgacacttfc gttctgctca tggcatgaga tcaaatcctc ttactgtgtt cccaaataac 13 9 0 
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ccggg 




<210> 
<211> 
<212> 
c2l3i. 


23 
34 
DNA 

Artificial sequence 


<220> 
<223> 


KSK6 specific primer 233 ] 
tha liana ecotype Columbia 


<4O0> 




atccogggtt 


atttgggaac acagtaagag gatt 


<210> 
<211> 
<212> 
<2l3> 


29 
27 
DNA 

Artificial sequence 


<22C> 
c223> 


MSK6 specific primer S3? 
thalian* ecotyp* Columbia 




59 



3* 



gcgtfccgatc atcagcctct gtgttgc 27 



<210> 


30 


<211> 


3605 


c2L2> 


DMA 


<213> 


AraJbicfopsis t^a liana eeotypp 


<220> 




<221> 


CDS 


<222> 


(142) .... (3468) 


<223> 


AtMSHS full-length cDNA and i 




polypeptide 




*0 



saaagttgag ccctgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt 60 

tggcgccaat ccttccccee tttcgaatce ectcagctea aaacatcgtt tctctctcac 120 

tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg tct ttc 171 

Met Gin Arg Gin Arg Ser Ue uu Ser Phe 
15 10 
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ttc caa aaa ccc acc gcg gcg act acg aag ggt ttg gtt tec ggc gat 213 
Phe Gin Lye fro Thr Ala Ala Thr Thr Lys Gly Leu Val Ser Gly Asp 

15 20 25 

get get age ggc ggg ggc ggc age gga ggs eca cga ret aat gtg aag 257 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Giy Pro Arg Phe Asn Val Arg 
30 55 -40 

gaa ggg gat get aaa ggc gac get tct gta cgt ttt get gtt teg aaa 315 
Olu Gly Aap Ala Lye Gly Asp Ala Ser Val Arg Pha Ala Val Ser Lye 
45 SO 55 

tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 363 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lya Val Pro 
6$ 65 70 

cgt cgt gtc ctg ccg tct gga ttt aag ccg get gaa tec gec gsfc gat 411 
Arg Arg Val Lou Pro S&r Gly Phe Lys Pro Ala Glu Ser Ala Gly Aap 
75 SO SS 90 

get teg tec ctg tCO tec aac ett atg cat aag tct gta aaa gtc gat 459 
Ala S^r ser Leu Phc Ser *r.r. ; 1* k^c: His Ly& Pbe Val Lye Val Asp 

1DO 105 

g$c cga gat tgt tcz gga gag agg age cga gaa gac gtt gtt ccg ctg 50? 
Asp Arg Aep cys Ser Gly Glu Arg Bar Arg Glu Asp Val Val Pro Leu 
110 115 120 

sat gat. tea tct cca tgt atg sag get a&t gat gcr. att cct caa tec 5SS 
Asn Aep Ser Ser Leu Cys Met. Lys Ala Aan Asp Val lie Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga auC cat get ttt age ttc 603 
Arg Ser Asa Asn Giy Lys Thr Sin Glu Arg abji His Ala Phe Ser Phe 
140 145 150 

*gt ggg aga get gaa ett aga tea gta gaa gat ata gga gta gat ggc 6 51 

Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp lift Gly Val Aap Gly 
155 160 165 170 



gat gtt cct ggt cca gaa aca cca ggg atg cgt cc3 cgt get tct cgc 
Asp val Pro Gly Pro Glu Thr Pro Gly Mat Arg Pro Arg Ala Ser Arg 
175 160 1S5 



659 



ttg asg cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 74 7 

Leu Lys Arg val Leu Glu Asp Glu wet Thr Phe Lys Glu Asp Lys Val 
190 1S>5 200 

cct gta ttg gac tct aac aaa agg ctg aaa atg etc cag gat ccg gtt 
Pro Val L«u Aep Ser Asrt Lys Arg Leu Lya Met Leu Gin A&p Pro Val 
205 21D 215 
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t^t gga gag <iaq aaa gaa gta aac §aa gga acc aaa ttt gart tgg ctt 043 
Cya Gly ''Jiu Ly* Lye Glu Val At?x> Glu Gly Thr Lye Phe Glu Trp Leu 
220 225 230 

gag tec tct cga acc agg gat gec aat aga aga cgt ccc gat gat ccc 391 
Glu ser &er Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 250 

etc tac gat aga aag acc tea cac ata cca cct gat get ttc aag aaa 93 9 

Leu Tyr Asp Arg Lys Thr Lau Bis He Pro Pro Asp Val Ph© Lye Lys 
25S 260 265 

atg tct gca tea caa aag caa tat tgg age gtt aag agt gaa sat atg 987 
Ket Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Mat 

270 275 2S0 

gac att gtg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ccg tat gag 103 5 

Kep lie Val Leu phe Phe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 

2&5 2DO 295 

eta gat gc^ gaa eta ggt cac «»y 9^3 c t£ 9 ac tgg stag Atg acc atg 10B3 
J-eu Asp Ala GIxl Leu rsly Kis Lys Ola Leu Asp Trp Lys Met Thr West 
3 0 3 .305 310 

agt ggt gvg 99* aaa tgc aga t*ag gt?; ggt ate tct gaa agt ggg ata 1131 
Sex Gly V*l Gly Lys Cys Arg Gin Val Gly lift 5er Glu Ser Gly He 
315 320 325 330 

gat gag gca grg caa aag ctfi eta gec cg^ gga tat aaa gtt gga cga 1179 
Agp Glu Al=i Val Gill Lys Leu Leu Ala Arg Gly Tyx Lys Val Gly Arg 
535 34$ 345 

ate gag cai ctfis gaa aca tct gac caa gca aaa gec aga ggt get aat 12 2*? 

He Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Ala ASA 
350 35b 360 

act ata att cca agg aag eta gtt cag gta tta act cca tea aca gca 1275 
Thx lie lie Pro Arg Lys Lau Val Gift Val Leu Thr Pro Sar Thr Ala 
355 370 3*?5 

age gag gga aac? ate ggg cct gat gec gtc cat ctt etc get ata aaa 1323 
S«r Glu Gly abj> rie Gly Pro Asp Ala Val His Lbu Lou Ala He Lys 
380 IBS 390 

gag ate aaa atg gag eta caa aag cgt tea act gtg tat gga ttt grct 1371 
Glu He Lya Met Glu Leu Gla Lys Cys Ser Thr Val Tyr Oly Phe Ala 
395 4G0 405 410 

ttt gtc gac tgc gec gec ttg agg ttt tgg gtc ggg tec ate age gat 1419 
Phe Val Asp CyB Ala Ala Leu Arg phe Trp Val Gly Sec lie Ser Aap 
415 420 425 
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9AC gcs tea tgt get gen ctt gga gcg tta ttg atg cag gtt tct cca 1467 

Asp Ala Ser Cya ALA Ma Leu Gly Ala Leu Leu Met Gin Val Sat Pro 
430 43S 440 

aag gaa gtg tts tac gac age aaa ggg eta tea aga gaa gca caa aag I51S 
Dy b Glu Val Leu Tyr Asp S«r Lya Gly Leu SCr Arg Glu Ala Gin Lys 
445 450 455 

get eta agg aaa tat acg ctg aca ggg tct acg gcg gta cag ttg get 156 3 
Ala Leu Arg Lya Tyr Thr Leu Tbr Gly Ber Thr Ala Val Gin Lbu Ala 
460 455 470 

cca gta cca caa gta atg ggg gat aca gat get get gga gtt aga aat 1611 
Pro val Pro Cln Val Met Gly Asp Tbr Asp Ala Ala Gly Val Arg Aen 
475 430 4$5 490 

ata ata gaa tcb aac gga cac ttt aaa ggt tct tct gaa tea tgg aac ifiS9 
lie lift Glu ier Asn Gly Tyr Phc Lys G\y Ser ser Glu Ser Trp Asn 

<t$S 500 SOS 

tgt gec gtt gat ggt eta aat gaa tgz> g3t get. gec ctt agt get ctt 170V 
Cys Ala Val Asp Gly Leu Asr. Glu Cys Asp Val Ala L.eu Ser Ala Leu 
f.10 Si 5 520 

gga gag eta act acn; cat ctg tct, agg Cta aag eta gaa gat gta ctt- 1755 
Gly Glu Lev He Abii His Leu Ser Arg Leu Lya Le»j Glu Asp Val Leu 
525 550 



aag cat ggg gat att ttt cca tac caa gtt tac agg ggt tgt etc aga 
Lye Hi 5 Gly Asp lie Phe Pro Tyr Gin Val Tyr Arg Gly Cya Leu Arg 
540 545 5-50 



gat ggt ggc ccfc tea ggg ace ttg tac aaa tac ctt gat aac tgt gtt 
Asp Gly Gly Pro 3* r Gly Thr Lbu Tyr Lye Tyr Leu Asp Asn Cys Val 
$75 5S0 535 



iao3 



act gat ggc cag aog &tg gta aac ccc gag ata tt: aac aat age cgt lbSi 
lie Asp Gly Gin Thr Mat val Asn Leu Glu lie Pha abii hsn ser Cys 
555 SfiO 565 570 



1B99 



age cca act ggt aag cga etc tta agg aat tgg ate cgc cat cca etc 1947 
eer Pro Thr Gly Lya Arg Lau Leu Arg abii Trp lie Cys H±b Pro Lau 
590 BBS 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gca gtc gaa gaa etc 1995 
Lya Asp Val Glu Ser 11* A*n Lys Arg Leu Aap Val Val Glu Glu Pha 
605 610 615 

aeg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 204 3 

Thr Ala Asn Ser Glu 6 r ?*et cln He Thr Gly Gin Tyr Lbu His Lys 
620 625 630 
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ctt cca gac 
Ueu Pro ABp 

635 

tea tea g cc 
S^r ser Ala 



caa cga gtt 
Gin Arg Val 



att gat ctg 

I la Asp Leu 
663 

ctt tat aa* 

Leu Tyx Lya 

gag tta ctt 

Glu Leu Phft 
715 

aat tat cag 

Aen Tyr nln 



ata ctt ate 
jle Leu lie 



att cac acc 
lie His Thr 
765 

gca agt etc 
Ala Ser Leu 

tea gaa get 
Ser Glu Ala 

79$ 

ate caa gqa 
He Gin Gly 



cct gtt ccg 
Pro Val Pro 



tta gaa aga 
Leu «u Arg 

tct gtg ttg 
ser val Leu 
£55 

Ada gca ttt 
Ly* Ala Phe 
670 

ttg ttg get 
Leu LdU Ala 



etc tgt aaa 
Leu Cys Lys 



ctt tct caa 
Lr>\j Sftr Gin 
720 

aae caa gat 
A*» Gin ABp 
73S 

gaa ctt ttt 
Glu L-eu Phe 
750 

ata age tge 
lie Ser cya 



tct get gga 
Ser Ala Gly 



aca gat cag 
Thr A ep Gin 
300 

eta tgg cat 
Leu Trp His 
eis 

aat gat ata 
Asn Asp He 
930 



etg etc gga 
Leu Leu Gly 



cct get ctt 
Pro Ala Leu 



ggg caa att 
Gly Gin lie 

675 

eta cag aag 
Leu Gin Lys 

ctt cct ata 
Lgu Pro lie 
70S 

ttc Qaa gca 
fc>he Glu Ala 



gtg aca gat 
Val Thr Asp 



ate gaa aga 
lie GLu Arg 
755 

eta gat gtc 
Leu Asp val 
770 

age atg gee 
eer Mat Ala 

785 

aat cag aaa 
Aen Gin Lys 



cca ttt gca 
Pre* Phe Ala 



etc ctt ggc 
Leu Leu Gly 

635 



26 

ego ate sag 
Arg tie Lys 

etg ggg aaa 
Leu Gly Lys 
£60 

gtg aaa ggg 
Val Lys Gly 



gaa tea aat 
Glu eer Asn 



tta gca gga 
Leu Val Gly 
7in 

gee ata gac 
Ala lie Asp 

gaa aae get 
Glu Aan Ala 
740 

gca act caa 
Ala Thr Gin 



ctg aga tct 
Leu Arg ser 



agg cct gtt 
Arg Pro Val 
790 

aca aaa ggg 
Thr LyB Gly 
605 

gtt gca gec 
Val Ala Ala 
830 

gag get aga 
Glu Ala Arg 



tct age gtt 
Ser Ser Val 



aaa gtg ctg 
Lys val Leu 

ttc aga agt 
Phe Arg Ser 
S30 

atg atg agt 
Met Met ser 

aaa age ggg 
Lys Ser Gly 



age gac ttt 
ser Asp Phe 



gaa act etc 
Glu Thr Leu 

745 

tgg tct gag 
Trp Ser Glu 
7£f> 

ttt gca ate 
Phe Ala lie 
775 

att tct ccc 
lie Phe Pro 



cca ata ctt 
Pro He Leu 



gat ggc caa 
Asp Gly Gin 
B25 

aga age agt 
Arg Ser Ser 



eg a 2Q91 
Arg 

SS0 

aaa 213 9 

Lys 



gga 2137 
Gly 



ttg 2255 
Leu 



Cta 2283 
Leu 



cca 2 531 

Pro 

730 

aca 2379 
Thr 



gtc 2427 
val 



gca 2475 
Ala 



gaa 2523 
Glu 



aaa 3571 

Lys 

&10 

ttg 2$i9 

Leu 



ggc 2667 
Gly 
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age att cat cct egs tea ctg tcs ctg acg gga cca aac atg ggc gga 2715 
Ser lie Hie Pro Arg Ser Leu L*u Leu Thr Gly Fro Asn Met Gly Gly 

BSD 

saa tea act etc etc cgt gca aca tgt ctg gec gtt ate ttc gec caa *7fi$ 
Lys ser Tbr Leu Leu Arg Ala Thr Cys L*u Ala Val lie Phe Ala Gin 
860 S65 6*70 

ctt ggc tgc tac gtg ccg tgt gag tct tgc gaa ate tec etc gtg gat 2S11 
Lea Gly Cys Tyr Val Pro CyB Glu Ser Cye Olu He Ser Leo Val Asp 
875 390 B35 S^O 

act ate ttc aca agg ctt ggc gca tec gat aga ate atg aca gga gag 2B5 9 
Thr He Phe Tbr Arg Leu Gly Ala Ser Asp Arg 3 Id Mftt Thr Gly Glu 

900 90S 

age acc ttt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag aat 
Ser Thr Phe LcU Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
310 915 «0 

gca act cag gat tea <:ta gta ar.c cmlc gac <?a3 ccg ggc aga gga. act 
Ma Thr Gin Asp Ser v<ti I : ts Leu Asp Glu Lev oly Arg Gly 'Thr 

92S ^30 935 

agt act ttc gat gga cac gec acc 90a tac teg get ttt cgt cac ccg 
Ser Thr Phe Asp Gly Tyr Ala lie Ala Tyr Ser Val Phe Arg His Leu 
940 54S 9S0 



gta gag aaa get eaa tgt egg atg crt.c ttt gca aca cat tac e^c cct 
val Glu Lya Val Gin CyS Arg Met Leu Pbe Als Thv His Tyx Hi b t>ro 
955 ' 960 96S 970 



2907 



30S1 



etc acc aag gaa ttc gcg tct cac cca cg£ gtr acc teg aaa cac atg 
h&u Tbr Lye Glu Phe Ala Ser His Pro Arg Val Thr Ser Lye His Met 

9-75 seo 9ss 

get tgc gca ttc aaa tea aga tct gat tat caa cca cgt ggt tgt gat 3147 
Ala Cys Ala Phs Lys Ser Arg Sec Asp Tyr Gin Pro Arg Gly Cys Asp 
$$0 995 1000 

caa gac eta gtg ttc ttg tac cgt tta acc gag gga get tgc cct gag 3195 
Gin A*p Leu Val Phe Lea Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
LOOS 1010 1015 

age tac gga ctt caa gtg gca etc atg get sga ata cca aac caa gtg 3243 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly lie Pro Asn Gin val 
102D 1025 1030 

gtt gaa aca gca tea ggt gec get caa gec atg aag aga tea att ggg 32 9i 

Val Glu Thr Ala Ser Gly Ala Ala Gin Ala H6t Lya Arg Ser He Gly 
1035 X04D 1045 1050 
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2e » • 

gga anc ttc aag tea agt gag eta aga tct gag ttc tea agt ctg oat 33 3 9 

Glu As>n Phe Lys Ser Ser Glvi beu Arg ser Glu Phe S«r Ser Leu His 
iOSS L06D i065 

gaa gac tgg etc aag cca ttg gt 3 ggt att tct cga gtc gec cac aac 3387 
Glu Asp Trp Leu Lye Ser Leu Val Gly rift Sor Arg Val Ala His Affn 
1070 1075 10BD 

aaC gec ccc att ggc gaa gat gac Cac gac act ttg ttt tgc fcta tgs 343 5 

Am Ala Pro Il& Gly Glu Asp Asp Tyr A9p Thr Leu Phe eye teu Trp 
10*5 1090 1€95 

cat gag ate aaa ccc tct tac tgt gtt ccc aaa taaatggcta 3478 
Kts Glu Tie Lys Ser Ser Tyr Cys Val Pro Lya 
1100 1105 

tgacataaca ctacctgaag ctcgttaagt cttctgcctc tctgatgttt attectccta 3 53 8 
aaaaatgett atatatcaaa uaactis^v;. etcgactaaa aaaaaaaaaa aaaaaaaaaa 759ft 
aaaaaaaok 360$ 



<210> 31 

<2ll> 1109 

<212j> PRT 

*£13> Arsi>i cfop^is eyiet liana ecotype Columbia 

<223> Polypeptide MSH6 

<400> 3i 

Met Gin Arg Gin Arg ser lie Leu Ser Phe Phe Gin Lye Pro Thr Ala 

Ala Thr Tbr Lys Gly Leu val Ser Gly abp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly ser Gly Gly Pro Arg Pbe Asn Val Arg Glu Gly Asp Ala Lys Gly 

40 45 

Asp Ala s*r Val Arg Phe Ala val Ber Lys ser val Asp Clu v*l Arg 
50 55 50 

Gly Thr Asp Thr Pro Pro Glu JUys Val Pro Aarg Axg Val Leu Pro 9*r 
65 70 7S go 

Gly Ph* Lys Pro Ala Gin Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 

35 90 95 

Asn lie Met Hia Lys Phe Val Lya Val Asp Asp Arg Asp Cys Sfcr Gly 

105 110 

Glu Arg Ser Axg Glu Asp Val val Pro Leu Asn Aap ser ser Leu Cya 
115 120 125 



WOW/19492 



29 



Met U'/s Ala Asn Asp val He Pro Gin Phe Arg Ser Asn Aaa Gly Lys 
130 135 140 

Tbr Gin Glu Arg Asn His Ala Phe Ser Phe Ser Gly Arg Ala Glu Leu 
145 ISO 1S5 XSD 

Arg Ser val Glu Asp He Gly val Asp Gly Asp val Pxo Gly Pro Glu 
26S 17D 175 

Thr Pro Gly Met Arg Pro Arg Ala S«r Arg Leu Lys Arg Val Leu Glu 
180 LBS 190 

Aep Glu Met Thr Phe Lys Glu Asp Lyfc Val Pro Val Le\* Asp Ser Asn 
1$S 200 205 

^ys Arg Leu Lys Met Leu Gin Asp Pro Val Cys <31y Glu Lys Lye Glni 
210 215 220 

Val Asn Glu Gly Thr Lys Pbe Giu Trp Leu Clu Scr Ser Axg He Arg 
225 230 23S 240 

Asd Ala A/*D Arg Arg Pro Asp Aop Pro Lev Tyr Asp Arg LyU Thr 

245 2S0 255 

Lou KJ5 He Pro Pro Asp Val Phe Ly* Lys Met Ser Ala Scr Gin Lye 
2SO 265 270 

Gin Tyr Trp Ser Val Lye Ser Glu Tyr Met Asp He Val Leu Phe Phe 
?.75 7BQ 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Lbu Asp Ala Glu Leu Gly 
29D 295 3O0 

Hi b Lys Glu Uu Asp Trp Lys Met Thr Met Scr Gly Val Gly Lya Cys 
30* 310 315 320 

Arg Gin Val Gly lie Ser Glu Ser Gly He Aap Glu Ala Val Gin Lya 
32S 330 335 

Leu Leu Ala Arg Gly Tyar Lys val Gly Arg Tie Glu Gin leu Glu Thr 
340 345 350 

Ser Asp Gift Ala Lyo- Ala Arg Gly Ala Aati Thr II© He Pro Arg Lya 
355 360 3€5 

Leu Val Gin Val Le-U Thr Pro Ser Thr Ala Ser Glu Gly Adrl lie Gly 
370 375 3B0 

Pro Aep Ala Val Hie Leu Leu Ala 11a Lys Glu He Lys Met Glu Leu 
3BS 390 395 400 

Gin Lya CyB Ser Thr val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 410 415 



gcammrv «-wo ooiojooa-tti 
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Leu Arg Phe Trp val Gly Ser lie Ser ab P Asp Ala Bar Cys Ala Ala 
420 425 430 

Leu Gly Ala Leu Leu Met Gin val Ser Pro Lye Glu Val Leu Tyr Asp 
43S 440 44S 

Sec Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lya Tyr Thx 
450 455 450 

- L«u Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Vel Met 
46S 470 475 480 

Gly Aap Thr Asp Ala Ala Gly Val Arg Asn lie lie Glu Ser AST! Gly 
465 450 495 

Tyr Phe Lys Cly Ser Bfir Glu Ser Trp AS11 Cya Ala Val Asp Gly Leu 
500 505 510 

Aan Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 
515 520 525 

Leu Ser Arg L*t* Lys Leu Glu Asp Val Leu Lys His Gly Asp I2e PV- 
"° 535 5(40 

PVO Tyr Gin Val Tyr Arg Gly Cys Leu Arg II© Asp Gly Gin Thr Met 
545 550 S55 56D 

Vel AEtn Lew Glu lie Phe Aati Asn ser Cyn Asp CSly nj.y Pro Ser Gly 
565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cya val Ser Fro Thr Gly Lyo Arg 
S£t0 585 590 

Leu Leu Arg Asn Trp He Cys Hie Pro Leu Lya Aep val Glu Ser lie 
S9 5 S00 €05 

Asn Lya Arg Leu Aep Val Val Glu clu Pbe Thr Ala Asn ser Glu Ser 
* 10 615 62 0 

Met cin I la Thr Gly Gin Tyr His Lya Leu Pro Asp Leu Glu Arg 
625 «30 SJS 640 

Leu Leu Gly Arg He Lys Ser 6er Val Arg Se-r Ser Ala ser val Leu 
645 650 6S5 

Pro Ala Leu Leu Gly Lys Lya Val Leu Lya Gin Arg Val Lya Ala Phe 
660 665 670 

Gly Gin I la Val Lys Gly Ph© Arg Ser Gly U Asp Leu fceu Leu Ala 
* 7S 680 SB 5 

Leu Gin Lys Glu Ser Aan Met wet ser Leu Leu Tyr Lya Leu Cys Lys 
59° 695 700 
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Leu Pro He Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 
705 7*0 715 7 20 

Phe Glu Ala Ala He Asp Ser ASp Phe Pro Asn Tyr Girt Asn Gin Asp 
725 -?SO 735 

val Tbx Asp Glu Asn Ala Glu Thr l^eu Thr lie Leu He Glu Leu Phe 
740 745 750 

lie Glu Arg Ala Thr Gin Trp Ser Glu Val tie Kia Thr Tie Ser Cys 
755 760 765 

Lbu Asp Vul Leu Arg ser phe Ala He Ala Ala ser Leu Ser Ala Gly 

770 775 730 

Ser Met Ala Arg Pro val He Phe Pro Glu Ser Glu Ala Thr Aep Gin 
795 790 795 800 

Asn Gin Lys Thr Lys Gly Prr> Tie Leu Lyc lie Gin Gly Leu rrp Mis 
805 810 815 

Pro Phi? Al-fr val Ala Ala Asp Sly Gin Lau Pro v«sl Pro Asn Asp lie 
B20 B25 B30 

Leu leu Gly Glu Ala Arg Arg Ser Ser Gly Ser lie His Pro Arg Ser 
BJ5 64D 645 

Leu Leu Leu Thr Gly Pro Asn Met Gly Cly Lys S<sr Thr Le\» Leu Arg 
£5D BSS 860 

Ala Thr Cy& LtsU Aid. Val lie Phe Ala Gill [j'exi Gly Cyg Tyr Val Pro 
365 870 675 B3Q 

Cys Glu Ser Cys olu lie ser Leu val Asp Thr lie Phe Thr Arg Leu 
BBS £30 895 

Gly Ala Ser Asp Arg lie Met Thr Gly Glu Ser Thr Phe- LGU Val Glu 
$00 $05 $10 

Cys Thr Glu Thr Ala Ser Val Leu Sin Ala Thr Gin Asp Ser Leu 

915 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
£30 335 940 

Ala lie Ala Tyx Ser Val Phe Arg His Leu Val Glu Lya Val Gin Cys 
945 950 9*5 960 

Arg Met Leu Phe Ala Thr Mis Tyr Kia Pro Leu Thr Lya Glu Phe Ala 
965 970 975 



Ser Kis Pro Arg Val Thr Ser Lys Kis Ket Ala Cys Ala Phe Lys Ser 
$B0 985 990 
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Arg Ser Asp Tyr Gin Pro Ar$ Gly Cys Asp Gin Asp J.«u Val Phe Leu 
99S 101?O IOOS 

Tyr Arg Leu Tbr Glu Gly Ala Cys Pro Glu 5e>: Tyr Gly Leu Gin Val 
10LO 1015 1020 

Ala L*u Met Ala Gly Tie Pro Asn Gin Val VaI Glu Thr Ala Ser Cly 
1025 10*0 1035 1040 

Al* Ala Gin Ala wet Lys Arg Ser He Gly Glu Asn Phe I>y& Ser Ser 
1045 10SO 1055 

Glu Leu Arg Sfcr Glu Phe Ser S^x Leu Bis Glu Aap Trp Leu Lya StbT 
1O60 1065 1070 

L&u Val Gly lie Ser Arg val Ala His Asn Asa Ala Pro Tie Gly Glu 

1075 1 OB D 1065 

Asp Asp Tyr Asp Thr L*u Phe Cys Leu Trp His Glu lie Lys> Ser Ser 
1090 1095 HOD ..v— -• 

Tyr Cys Val Pit> J.ys 
1105 



<210> 32 

<211> 24 

<212> D>1A 

<2i3> Artificial sequence. 
<220> 

<223> Forward primer for pcfc amplifj cation ot ATKGEtfEA 
microBatellitc 

<400> 32 



accstgcata gettaaactt cttg 24 



<210> 33 

<211> 22 

<2I2> DBA 

<213> Artificial sequence 

«220> 

<223> Reverse primer for PCR amplification of ATHGEN6A 
mi croa&tellite 

<400> 



acataaocac aaataggggt $c 22 



=;nnrtr> ^wn oo«o<»«y->»-rrt _ 
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<210> 34 

<2ll> IB 

< 2 1 2 > C'WA 

-.z>\2> * Artificial sequence 

<22D> 

<22 3> Forward primer DMCIn-a for pcr on genomic DNA of Arabidopsi s 
thai L ana esp, Land&berg eracta *Ler" 

<400> 34 

gaagcgatat sqtxcgtg 18 

Oio» as 

<212> DMA 

c2ia> Artificial sequence 

<220> 

<225> Reverse prima* DMCIN-B for PCR on genomic dna of Arabitfopsis 
thaliana ssp. Lands berg erects "Ler" 

<4D0> 35 

agatcgcgag aacartcc 19 

<21Q> IS 

<21i> 31 

<212> D«A 

<-2i5> Artificial sequence 
<220> 

<223> Forward primer DMCIN-l for PCR on genomic dna of ArabidopBis 
tha liana &sp« Lands berg erecta "Ler" 

c*DO> 36 

acgcgtcgac tcagctatga gattactcgt g 31 

<210> 37 

<2L1> 29 

<212> DNA 

*213> Artificial sequence 

<220> 

*:23J> Reverse primer DMCIM-2 fox PCR on genomic DNA of Arabidopeia 
ttiAliana asp. Landsberg ere^ta "Ler" 

<400> 37 

gctctagatc tcccgctcta agactctct 2 9 
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<2tl> 32 
<212> t>NA 

--213> Artificial sequence 

<220> 

<223> Forward primer DMCINO for PGR on genomic dwa of Arabidopeia 

tbaliana Bap, Landeberg e recta "lier a 

gctctagagc ttctcttaag taagtqattg at * 32 

<210> 39 
<211> 48 
<212> DMA 

«2J3> Artificial sequence 

<223> Heverse primer DMCIN-4 for VCt< on genomic DMA of Aratbidops i * 

r,*iAliar,a sBp. .Jandsberg erecta " Ler M 

<40Dv 59 

tccccc^cfijix x,oq&$*<jax*ci tccarggrt^. cttcagctct at.ga3tcc 4 8 

^2ilr> 26 

<213> Artificial sequence 

<22D> 

<223> Forward primer DMCla for PGR on genomic ONA of Arabidopeis 

theliana asp. Landeberg erect a "lier" 

acgcgtcgac gaattcgcaa gtgggg 2 6 

<213> 41 

<211> 36 

<2\Z > rWA 

<213> Artificial eequ&ace 
<220> 

c223:> Reveree primer DMCib for PC* on genomic DNA of Arabidopfiis 
thaliane sap, L.andaberg ere eta "Ler' 1 

<400> 41 



"XTC!I> <WO OQ1tUQ9A9TI -v 



.35 

cccatggaga tctcccgggt accgAtc ttcgaggg 3 6 

*210> 42 

<212> 20 

<212> DNA 

<213> Artificial sequence 
*22Q> 

<225> Forward primer for PCR amplif i cat ion of ATbati SSLP marker in 
Arabidopsis t ha Liana eubapeciee 

<400> 42 

gccaccgcgt gaatgatatg 20 

<2L0i- 43 

<21*> 22 

<212> DMA 

<213> Artificial aequejnce 
<22Q> 

<223> Reverse primer for PCR ampl i f i c a C i On of ATEATi SSLP marker in 
Arab i dope 1b thaJti^na subspecies 

<4f)Q> 43 

cg^acagcca scattaattc cc 22 



*210* 
<211> 

<213> 


44 
DNA 

Ar c i f i c i al eequenc & 


<220> 
<223> 


Forward primer for PCR amplification 
Arabidopaia thaliaoa eubspecifcs 




44 


aaccaaggca 


cagaagcg 


<210> 
<211> 
<212> 
<213> 


4S 
IS 
DMA 

Artificial .sequence 


<220> 





16 



123> Reverae prinier for PCR arciplif icat ion of MGA63 sstp marker in 

Arabidopaia thaliatna allbs-pBSies 



INS DOC ID: <WO 9919492A2TI_> 
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« A00> 


«5 


acccaagtga 


Lcgccacc 


< .£ J. SJ > 




<211> 


2X 


<2x2> 




<213> 


Artificial sequence 


<22Q> 




<223> 


Forward primer for PCR 




Axabidopsis tha liana bi 


<400> 




fcaccg&accB 


aaacacsaag q 


<210> 


41 


^21 1> 


22 


<:2l3> 


DNA 


<213> 


Art if icial sequence 


<220> 





IB 



21 



<223> Reverse primer fur PCR amplification of MGA24& SSLE> marker in 

Arabidopsis thaliana subspecies 

<40D> 47 

tctgtatctc ggtgaattct cc 2 2 

<211> 22 
<212> DNA 

<213> Artificial sequence 

<220> 

<22 3> Forward primer for PCR amplification of KTGA12& SSLP marker in 

Arabidopsia ttia liana aubspecies 

<400> 48 

ggtefcgttga cgtegtaagfc eg 2 2 

<214> 49 

<211> 22 

'-212 > DNA 

<213> Artificial sequence 
<220> 
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<221-* Reverse primer tor PCR ampi Ificacion of NGAi2B SSLP marker in 

Arabidopsis thalians aubsp-S-diee 



<•) DO > 


4 9 




atctcgaasc 


ctttagggag gg 


22 


<=2l0> 
<21L> 
<212> 
c21J> 


50 
22 

Artificial seqaence 




<220> 
*223> 


Forward primer for PCR aropli f ication of NGA2BO 
Arabi^opeiB thaliana subspecies 


sslp marker in 


<40D> 


SO 




ctgatctcac 


gyacaauagc gc 


22 


<211> 
<212;- 

<213> 


51 
20 
DNA 

Ar ti f icial sequence 




<22Q> 







<223=> ReverBe primer for PCR amplif lcaiion of NGA230 SSLP marker in 

Arabidopeis thaliana subspecies 

<400> SI 



<210> 52 

--511> 21 

<212s UNA 

<213> Artificial seqo&zice 
<220> 

<222> Forward primer for PCR amplification of SSLP nvarker in 

Arabidopai* tfta liana eu^apecies 

<400> 52 

ctccagttgg aagctaaagg g .21 

c210> S5 

<211> 2i 

<212> DNA 

<2l3> Artificial sequence 
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^220> 

<.22i> Reverse primer for PCR amplif ication o2 SSLP marker in 

Axabidopsis t ha liana subspecies 





SI 




tgttt tttag 


gacaaatggc 9 




c2lD> 
<211> 
<:212> 
c213> 


54 

20 

Art i f i C ia I e equen ce 




c220> 
c223> 


Forward primer for PCR amplification Df 
Arabidopsia t ha liana subspecies 


NGA1S8 £SLE> marker in 


<400> 


54 




ccttcacat^ 


caaaacccac 


20 


<210v 
r2ll > 
-:212> 
*213> 


20 
DNA 

Artificial sequence 




<220* 


Etevars^ primer for PCR amplif icacion ot 
Arabidopaia t ha liana aubcpecius 


P5A16B SSLP marker in 


<40D> 


55 




gcacataccc 


acaaccagaa 


20 


*211> 
<212> 
c213> 


o& 
20 
DNA 

Art i f i c i al a equence 




<220> 

<223:> 


Forward primer for PCR amplification of 
in Arabidopaia tha liana subspecies 


NGA1126 SSLP marker 


<400> 


56 




cgccacgctt 


ttcggtaaag 


20 
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<21Q> 57 
<211> 20 
--.212 > tiNA 

<zil:> Artificial sequence 

*22J> R*Vfcraa primer for PCR amplification of NGAH26 SSLP marker: 

in Arabidop&is chaliana subspecies 

<400> 5"? 

5C&c&gtCCa agtcacaacc 2< 



*21D> SB 

<211> 20 

<212> DMA 

<213> Artificial sequence 
*220* 

<22l> forward primer for PCR amplification of. NGA3 61 SSLP marker in 
Arabvtops j s Uicalinna subspecies 

<400> 58 

aaagagatga gaatttggac 20 
<210> 

<2ll> 23 

-:^15> DMA 

<£13 > Artificial sequence 

<223> Reverse primer ior PCR ainpl if ication of NGA3G1 SSLP marker 
Arabi dop* iff thai i Ana subspecies 

<400> 

&esst&tc<i&t atattaaagt age 23 



<2l0> 

<2ll> 18 

<212> DtfA 

<213> Artificial sequence, 
<220> 

<223> Forward primer for PCR amplification of HGA16S SSLP marker in 
Arab t dope is thai i ana sub«peci a 

<400> 60 



iNSDOCID: <WO 9919492A2TI > 









40 




ccgcc&actg 


cactgceg 


18 


<2H> 
<212> 
<2l3* 


61 
22 
DNA 

Artificial sequence 




<220> 
<223» 


Reverse prinver for pgr amplification of 
ftrabidopsis c ha liana subspecieB 


NGA158 SSLP marker in 


<40D> 


61 




gaggacatgc 


atagg&gcct eg 


22 


<210> 
<211> 
<2L2> 
<213> 


62 
20 
DHA 

Arc i £ i 0 ia 1 £<>qu ftn^. 




*220* 
<22i» 


Forward primer cor PCR anp^iCicacioa of 
in Arabidopsis r.ha liana subspecies 


AthBIo:? SSLP marker 


<400> 


62 




tgacctcccc 


ttccatggag 


20 


<2lD> 
<=211> 
<212> 
c2l3> 


22 
DMA 

Artificial sequence. 




<22Q> 
<223> 


Reverse primer for PGR amplification o£ 
in Arsbidopeis tha liana Bubspeciee ' 


AthBI02 SSLP marker 


<40Q> 


63 




ctaacagaaa 


cccaaagctc cc 


22 


<210> 
<212> 


64 
21 
DMA 

Arc illClni B cquSZlCc 




<220s> 
<223> 


Forward primer for PCR amplification of 
id Arabidopais t ha liana. Bubspecies 


AthtfBIQUE &&l»P marker 
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aqgcaaacgt ccjit ttcatt g 21 



<210> 65 

<21L> 2D 

<:212> DMA 

<211> Artificial, sequence 
<220> 

<223> Reverse prilher for PCR amplification of AthUBIOUE SSLP marker 
in Arabidopais tha liana subspecies 



<400> 


€5 






a<jat t tct cc 


20 


<210> 

<2ll> 
<2i2> 
*213* 


21 
DMA 

Artificial sft^ucnce- 




<22D> 


Forward primer for PCR amplif icat- i.<>rt of NCSA 172 
Arab i dope i e thes 1 1 ana subspec i e s 


SSXiP marker in 


<40D> 








ttatagcgtc c 


51 


<211> 
<212> 
<213^ 


67 
13 
DNA 

Artificial sequence 





<220> 

<223> Reverse priw^r for £cr amplification of UGai i ? SSLF marker in 

Arabidopsia thali&nA subspecies 

<400> 67 

cabccgaatg ceattgttc 19 

<211> 21 

<2l2> DWA 

<213> Artificial sequence 
<220> 
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42 








forward primer for PCR arnplif ic&cion of 
Arabidopsis chaliana subspecies 




^sip marfcer in 










gaaaaa^cgc 


fcactttcgtg g 




21 


c2l0> 
<211> 

<213> 


22 
DWA 

Art if icial tsequance 






c220> 
<223> 


Reverse primer for PCS amplification of 
Arab! dope is t ha Li ana subspecies 


NGA126 


SSLP marker in 


<40D* 








caagagcaat 


atcaagagca gc 




22 


*210> 
<211> 
<2l2> 
<213> 


20 
DMA 

Arc i f ic ial sequence 






<220* 
<223> 


Forward primer for PCR amplification of 
Arabiriopsis tbaliana subspecies 


NGA162 


SSLP marker in 




70 








gcatc cgagg 




20 


<2lO> 
<21L> 
c21*> 
«213> 


71 
12 
DMA 

Artificial sequence 






<220> 
<223> 


Reverse primer for PCR amplification o£ 
At abidop b i s thai iana subapec i e e 


NGA162 


SSLP marker in 




71 






ccctgtcact 


ctcttcctct gg 




2 2 


<210> 
<211> 
<212> 
<212> 


72 
21 
SNA 

Art i f i c i a l ©equenc e 
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Foirwac3 primer for PCR amplif Ication of 
Arabidopsia chaliana subspecies 


MGA6 SSLP marker in 


<400> 


72 




tggatttcrt 


cctctcttca c 


21 


<21Q:> 
<211> 
<2l2> 
<313> 


73 
21 
DNA 

Art t f icia 1 S equenc e 




*220> 
<223> 


Rav«rse pjrinier for pcr amplification of 
ArabidopBis thali&na subspecies 


SSLP marker in 




73 




atggagaacic 




21 


*2lO> 

*212> 
<213> 


7* 
20 
DMA 

Arti f icial sequence 




<22&> 


Forward primer tor PCR smpiif icst ion of 
Arabidopeis tha liana subspecies 


»£Al2 SSLP marker in 








aatgctgccc 


tcccctcctc 


20 


<211=> 
<212> 


75 
22 

Artificial ^qa^ce 




<223> 


fcevexae prinvex for PCR amplification of 
Arabidopsis thaliana aohspeciea 


MGAI2 SSLP marker* in 




75 




tgatgctctc 


cgaaacaaga gc 


32 



3NSDOCID: <WO 9919492A2TI_> 
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<210> 76 
<21L> 21 
-212> DMA 

■=31 3 > Artificial sequence 

<223> Forward primar for pcr atnplif ication o£ NGA8 SSLP marker in 

Arabidopsie thaiiana subspecies 

<400> 76 

gagggcaaat ctttattecg g 21 

<2lD> ni 

<2X1> 22 

<212> DWA 

<2l3> Artificial sequence 
<22D> 

<22S> Reverse primer ir,r PCR amp2 i f i cation of WGA6 SSXP marker in 
Arabiriopsis cfcsl.ieiiui si>bepf>cie& 

<4O0> 77 

tggctttcgt ttaeaaacat cc 22 

<2i0> 73 
<2*2> DMA 

<2l3> Artificial Bequeace 

<22Q> 

<223> Forward primer for pcr sunpli f icacion of NOA11D7 Sslp marker 

in Arabirtopeie thaliana subspecies 

<400> 79 

gegaaaaaac aaaaaaafccc a 21 

<210> 79 

<2X1> 21 

<212> DMA 

<213> Artificial sequence 
<22Q> 

<223> Reverse primer for PCR amplification of WGA1107 SSLP marker 
in Arabidopsis chaligna sub species 



<400> 79 
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cgae^assccg 










21 


<210> 












<2ll> 


21 






















c213> 


Artificial sequence 






















<223> 


Arsbidopsifi cha liana subspecies 






S3 LP 


marker in 


<4 D0> 












gaaatccaaa 


ccccagagag g 








21 


<2iD> 


Bi 






















<212> 


SNA 










<213> 


Artificial Bequecice 










*220> 












<221* 


r\evet&e punier iOI 1X_K Snlpl 111 Co r 1 OH 

Arsfcd daps is r ha liana subspeciea 


ox 


NGA22& 




marker in 


- ^1 V V r' 












tctccccact 


agttttgtgt rc 








^ «* 


*210> 


B2 










*211> 


13 










c2l2> 


ONA 










<213> 


Artif i cisl ee que ID Cc 










<220> 












<223> 


Forward Drinier for pch anit^l 4 f i r^t -5 ^« 
Arabidopati© trial i an a subspecies 


Ul 






tor r Rcr in 


<400> 












taccgtcaafc 


tccatcgcc 








i& 


±210 > 


B3 










<211> 


22 










<:2l2> 


DNA 










<213> 


Artificial eei?ue;nce 










<220> 












<223> 


Reveraa prima r for PCR amplification of 


NGA24? 


SSLP 


marker in 



Arab i dopes is trial iana subspecies 
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4G 

<40G> S3 

ggatccc^aa ctgtasaatc ccr 2 -> 

<210> 3-i 

<212* DNA 

<2i$> Artificial sequence 

<220> 

<2*3> Forward prinusr for PCR amplif icaLion of CA?2 SSLP marker in 

Arabidopsis thaliana subspecies 

<400* B4 

eatcccagta accaaacacs ca 25 

<2L1> 20 
<2L2> DNA 

<2L3> Artificial .-sequence 

<220* 

<223> Keverse primer for PCR amplification of CJ0 2 ssl-p marker in 

Arabi «iops 1 R- cha 1 L aim sufospec 

<40o* as 

cccagtctaa ccacgaccac 20 

<2l0* Rh 
<:2il> 2Q 
<212> DNA 

<2i3> Artificial sequence 

<220> 

Forward primer for PCR ampl i£i Dation of MGA1SJL SSLfc marker in 
Arabidopaie thai i ana subspecies 

<400> 85 

gtttcgggaa gttttgcegg 20 

<210> ^7 

<21i> 24 

<212> DMA 

<2l3> Artificial sequence 
<22D* 
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-.223 > 


Reverse pariroer for ^CK amplif icatit>n of 
Arabid&peis t ha 11 ana eubapecies 


NOA15-1 


SSLF 


marker in 


<40O> 










cagtctaaaa 


9C3-agagtst gaLg 






24 


<210> 
<2I1> 
<212> 
<213> 


ea 

22 
DMA 

Artificial sequence 








<220> 
<223> 


forward prim&r fox PCfc ampiif i cat ion of 
Arabidopsia thai a ana subspecies 


NOA106 


SSL.P 


ina2r}cer in 


<*00> 










ge tatggagt 


ttciagggca eg 






2? 


<2ll> 
<212> 
<213> 


20 

Artificial sequence 








<220> 


Reverse pritr^r for PCR amplif icaclon of 
Arab i dopa is c ha 1 i ana 5 ubspec ids 


NGA1 0 6 


SSLP 




<400> 










tgccc eaten 


tgttcttctc 






20 


<210> 
<211> 
< 2l2> 
<213> 


20 
DNA 

Artificial sequence \ 








c220> 
<223> 


Forward primer for FCfc amplif ic&tion of 
Arab i dopa i a thaliana aubspeciea 


NGA13 9 




marker in 




30 








agagc&acca 


qatccgatgg 






20 


<210> 
<2I1> 

c213> 


91 
21 
DBA 

Artificial sequence 
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4S 



A U > 








< 2 2' "5 > 


Reverse primer for PCR ampl 1 f 1 cat i on 
Arabidopaia t ha liana subspecies 


of 


NGA13 9 s$i*p rnarkftr in 


< 400 > 








ggtttcgttc 


cactacccag g 




21 


<210> 


92 






<211> 


22 








DNA 






<213> 


Artificial sequence 






<220 > 








<223> 


forward primer for PCR amplification 
Arabi<5opBis tha liana subspecies 


of 


NGA9 6 SSLP marker in 


*400> 


92 






ggsgaaaatg 


tcaccctcca cc 




22 


<2i0> 








<2X\> 


20 






<212> 


DNA 






<213> 


Arti f icial sequence 














<223> 


R*v*rse primer for PC* amplification 
Arab i dopa i a t ha 1 i ana «?ub£p6cie a 


Of 


HGA7G SSLP iftarkcr in 




93 






aggcat^gga 


gecatttacg 




20 


<210> 


94 






<211> 


20 






<212> 


XXNA 






<213> 


Artificial sequence 






<22D> 








<223> 


Forward prima r for pcr amplification of 
irt ArabidopaiB tbaliana subBpeciee 


ATHSOX91 £5 LP inarker 


<400> 


94 






ccccaccaat 


catgeaaatg 




20 
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<210> $5 
*212> DHA 

<:213> Artificial sequence 

c220> 

<223> fte verse primer i or PCR acnpli fiction of ATKSOISi SSLP marker 

in Arabiriop&is tbzil iana subspecies 

<400> 95 

tgatgttgat ggagacggtc a .21 

<210:> 96 
c21I> 22 
<212> DMA 

<213> Artificial sequence 

<220> 

<22i> Forward primer for PCR amplifier tier, of NOA*2 9 SS"LP marker in 

Ar&bidopsis th*i liana subspecies 

<400> 

tcaggaggaa ctaaa-gtgag $g 22 

<210> 97 
*2ll v 22 
<312> DNA 

<213> Artificial sequence 

<223> Reverse primer fox PCR ar&pli fixation Qf NSA13 9 SSLP ?narker in 

Arabidopsis thai lama subapecieo 

*40Q> 97 

cftcaccgaag atggtcttga gg 22 



<210> 


93 


<211> 


$0*2 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<223> 


Genomic DNA 



<400> 97 

utttctggtt get acaata aaggtatacg gttttatgec atcaatataa ctatautaa 6d 
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aagaaacgaa 


agatatatat 


trgttttccca 


tttateaaac 


s a a a caac a a 


gactttctct 


12 0 


ttacttttta 


C3 ttggtcaa 


caa&.itacaa 


gataaacgae 


ancg^t taat 


cattte-scaa 


lao 


ttttacccet 




cctagaaccr 


tctccatctt 


cgcaagcaca 


gectgattag 


240 


gaacagcttt 


accactcfcca 


tattcccgaa 


ctacctgagt 


cctctcattg 


atctgtctcg 


3 00 




ttgtgacatc 


ttcttetcca 


atctcgctc t 


eegtatcatc 


aacctcacct 


3«0 


ctgccttcac 


acgatccatc 


gccgcaggct 


ctgtttcc.ee 


ttceagcttc 


ttcgtgctaa 


420 


tCaCeggAac 


cgccgtagac 


ttcccctttt 


tgttcgaacc 


ggcatcgaat 


ttcttaacqg 


480 


tttgaaccgc 


gaeacegtte 


ctcagagctg 


cgttaaccgc 


ttceggateg 


egtaggtet t 


34 0 


ggctcttttg 


CC ttgatctg 


tggagaacta 


ctggctccea 


gtcttgtgtt 


actgctcctg 


GQQ 


ggtatctgct 


cggcatcgtc 


gatgaattga 


gagsaaggaa 


caacgegaaa 


attttattaa 


660 


tctgagc" t 


gaaattgaga 


aacgstgaag 


aegaagaatg 


t cgttgagaq 


gatcgegata 


720 


tctacscata 


egaagategg 


t ttcxggaga 


at tcgatcac 




attttegtet 


?ao 


ctggaaegtfc 


ct fcagagatg 


atrgacgacg 


tgtcattatc 


rgatttgcag 


ttaaccaatg 


940 


ctttttgggt 


tggattcgtg 


gtacaceata 


ttatccgatc 


^ggctcaatg 


gtttieAtata 


9 00 


aattcggctc 


tcggttcggt 


tatgagtcat 


CdtLaaaatt 


aagctaacca 


aaaatttccg 


960 


taaaattcst 


ttcggtttca 


actcggatcc 


cttacttccA 


gAaccgaatt 


Attcgaaacc 


1020 


ggggctagcc 


gaaccgaata 


ccaatgcctg 


attga^tege 


uggctagaaa 


gatccaaegg 


ioeo 




tagaacataa 


aceggaeggt 


catCaaagcc 


tcaaagagtg 


eacagtcaac 


1140 


aaaaaaagtt 


gagecctgag 


gagcategtt 


tccgccrattt 


ctacgacgca 


a-ggegaaaa t 


1200 


ttttggcgcc 


aatotctccc 


eectttcgaa 


ttctctcagc 


tcaaaacatc 


gtttctctct 


1260 


cactetcect 


cacaattcca 


aaaaatgcag 


cgecagagat 


cgactttgtc 


tttcctooaa 


1320 


aaacccacgg 


cggcgactac 


gaagggtttg 


gtttccggcg 


atgetgetag 


Cggcgggggc 


13B0 


ggcagcggag 


accacgattt 


aatgegaagg 


aaggggatgc 


taaaggegae 


gcttctgcac 


1440 


gttttgcegt 


ttcgaaatct 


gtcgatgagg 


ttagaggaac 


ggacactcca 


ceggagaagg 


1SO0 


ttccgcgecg 


tgtcctgccg 


tctggattta 


ageeggctga 


atccgccggt 


gaegcttegfc 


1560 


ccctgttctc 


caarattatg cataagtttg 


taaaagtcga 


tgatogagat 


tg^tctggag 


1520 


agaggtacta 


afcettcgatt 


ctcttaattt 


tgtcaccttt 


ageeggaaga 


aga agate eg 
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tgtaat ttgt 


cgt attcgtt 


gg^gagat tc 


tgatt.act.gc 


attggatogt 


tgtctacaaa 


L740 


tt.c tcaggag 


ccg Agaagac 


gfct.^ ttcc^e 


bgaatgatcc 


atctct atgt 


atgaaggct a 


LSOO 


a tga tg c l3C 


Lcct raa lll 


cgttccaaca 


atggtaaaac 


Ccaagaaaga 




1360 


c u a<j t u c cag 


gagsgct 


gaac c t aga c. 


c agtagaaga 


c at aggagta 


ga&ggcgatg 


1 32 0 


c ccccggccc 


agaaacs cca 


ggga tgcgt c 


c acgt gc t tc 


ccgcttgaag 


cgagttctgg 


1 980 


agga&gaaat 


gactt&taag 


gaggafcaagg 


ttcctgtatt 


ggactc&aac 


aaaaggctga 




aaatgctcca 


ggatccggtt 


tgcggagaga 


agaaagaagt 


aaacgaagga 


accaaatttg 


2100 


aatggcttga 


gtcttctcga 


atcagggatg 


ccaatagaag 


Acgtcctgat 


gatccccttc 


2160 


acgatagaaa 


gaccttacac 


atsccacctg 


atgttttca* 


gaaaaegtet 


gcatfacaaa 


2220 


egcaatatcg 


gagcgrtaag 


agcgaataca 


tggacattgt 


gctttccttt 


aaagtggtta 


2250 


gtaactatta 


atctagtgtt 


caat ccatcc 


cctcaatgcg 


atttgcccac 


t tacatctgt 


2540 


tcacgtcatg 




gggaaeittt t 


atgagctgta 


tgagctagat 


gcggaatcag 


24 00 


gtcacaagga 


gcttgactgg 


aagscgacca 


tgagtggtgt 


gggaaaatgc 


agacaggcaa 


24«D 


attagttgaa 


acaactggcC 


egcttgaatt 


attgtgtcta 


taaattttga 


caccaccttt 




tgtt tcaggt 


tggtacctct 


gaaagcggga 


tagatg&ggc 


agegcaaaag 


otjstttigctc 


25&0 


grgggtaagg 


gaaccatcat 


accttatgga 


atccgtttac 


tgetactteg 


gctaggatct 


2 6 40 


asg aaa tgg3 


aatcac&tca 


agcatcatta 


gttaggatcc 


tgagaac&ca 


ggatgtttt c 


2*7 00 


ttat t cgt t a 


ta t aataag t 


ct tctcac ca 


aggagtaaca 


aacaaaactt 


gcacaacat c 


2 7 60 


c.gu g c gcc.ca 


c tggcaagg c 


atatataccc 


a gc taacr t % 


tgctagttca 


ctgtagcaac 


2 8 SO 


ag v x. & cgga c 


A <*W MB A k_ ^mAm lb. 

aacatatgt t 


tacttgtatg 


t ggtacc etc 


^ W b» 

attxtgtctc 


tcatggaggc 




u c u c aagc c c 


m A* aw a A 

t-g c Lga a a 


aVM am. a«fe au> 

ccggacagtt 


acatatgc t t 


ccaacagaaa 


ct.agcat.gca 




gattcatacg 




ctaccaatta 


tgtat tgaca 


cact cgt tg t 


ttcttttgaa 


•4 f\ ft. 


agatataaag 


ttggacgaat 


cgagcagcta 


gaaacatctg 


a ccaagcaaa 


agecagaggt 


3 06 Q 


gctaatactg 


caagttttct 


tggataggtc 


aaggagagtg 


ttgcagactg 


tttttgatca 


2120 


tttctttttc 


tgtacattac 


tttcaegctg 


caac&aactc 


aatggctatt 


ctggtctgat 


3180 




ctccaaggaa 


gccagttcag 


gtattaactc 


caccaaeagc 


aagcgaggga 


3240 


aacateggge 


ctgatgccgt 


ccatcttctt 


gctataaaag 


aggtttgtta 


tttacttatt 


3300 
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tatcucatca tgttcagttc atccaagtcc 
ccAtca^cC gfcgtaaagga tttggaatta 
tgca&gaggt tcccttgaaa agacctgttt 
ttttaagtga atcccctact tccccacaat 
tatcattcct t^ttactgca tccccctctt 
tctcgtgtgc! tggagaaaat gatagctgat 
tgcteaggaa etgcetttgg ttacatcgcc 
Gcetcacttt. agatca&aat ggagctacaa 
gtcgactgtg ctgccttgag gttttgggct 
gctcttggag cgttattgat gcaggtaagr 
cgacttcccg tgcatatatt tgggt:r r gcag 
ettttrcagg tctctccaaa ggaagtgtta 
ccagttgtct tgtcaaacag aatctaaggc 
atgttgaaat attgcagggc tatcaagaga 
gacaggcacc acttcagcag gcaagctsac 
tctcrtaaac atfcgetaatg tagatgatgt 
gtacagcxgg ctccagtacc acaagcaatg 
ataatagaat ctaacggata ctttaaaggt 
ggtctsaatg aatgtgatgt tgcccttagt 
aggeeaaagg bgtgttggcc tgtttagttt 
ttcstaacfct aeagtttcta tctacttgca 
ttttccatac caagtfcfcaea ggggttgtct 
tgagatattt aacaatagct gtgatggegg 
ataacctcaa ctagagggca gacatagaag 
agtaagtaat: agccgatttt tgtttaccta 
ctgtgttagc ccaactggta gcgactett 
tgtagaaagc atcaataaae ggcfcfcgatgt 



52 



tgaaaaatta 


cactcttctt 


taccastctt 




gaaaatcatt 


atttgatgct 


ctgtttt&ba 


3420 


aagatcctCt 


gcacecgaaa 


aactcaatct 


3430 


c^aecatagtc 


cgcaactgca 


tgccaagcaa 


3 540 


tcteaatgac 


cattgtctat 


gttgtgtctg 


3€QQ 


ccaagctgta 


cat t a teat g 


atfcaagtagr 




taatggtttg 


atgtcaatt t 


ttcttctgaa 


3720 


aagtgttcaa 


ctgtgtatgg 


atttgettte 


37B0 


gggtctatca 


gcgatgatgc 


atcatgtgct 




aagtcytatdc 


trgtatctcac 


gtgtaccatg 


2300 


gaactaattc 


tgaatcacca 


tttggtatgt 


3960 


tatgacagta 


aaggtaaaet 


gcttgtatcg 


4020 


aaatgacacL 


ggttaattta 


aagtgcatac 


40-90 


agcacaaaag 


gcLctaagga 


aatatacget 


4140 


tgac&attta 


acegctcacc 


gaatgatagg 


4200 


ttatgtttca 


at^taatagg 


gtctacggcg 


42b0 


^gggatacag 


atgctgctgg 


agttagsaac 




tcttetgaat 


catggaactg 


tgctgttgat 


43S0 


gctcttggag 


a^ctaattaa 


tcaccrgtct 


4440 


rtgcttttca 


ca&attaagc 


aaaggaactt 


4500 


gctagaagat 


gtaettaagc 


atggggatat 


4560 


cagaattgat 


ggecagaega 


fcggtaaatct 


4620 


tccttcaggc 


aagcgcafcafc 


ttcttttttg 


4680 


gaaaaattcfc 


aanaettcgt 


acggatctcc 


4?40 


tgtagggacc 


ttgtacaaat 


afcctfcgataa 


4B00 


aaggaattgg 


atctgccatc 


cactcaaaga 


4B60 


agttgaagaa 


steaeggcaa 


actcagaaag 


492 0 
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MDgcaaatc 


actggccagt 


atctccacaa 


acttCcagac 


Ctaaaaagac 


tgctcggacg 


4350 


catcaagtct 


agcgt tcgat 


catcagcctc 


tgtgttgcct 


gctctcctgg 


ggaaaaaage. 


504 0 


gctg»aacaa 


cgagtaagta 


tcaatcacaa 


gttttctgag 


Caatgccttc 


catgagcagC 


510€ 


acaggactaa 


aacattacgg gtctagctaa agactgttct 


ccttcttttg 


caatgtctgg 


516D 


tcattcatta 


catttctcec 


aacttsttgc 


atbgcaggtt 


aaagcatttg ggcaaattgt 


5220 


gaaagggttc 


agaagtggaa 


ttgatctget 


gttggctcta 


cagaaggaat 


Caaat.at.gat 


529 0 


gagttfcgctt 


tataaactcc 


gtaaacttcc 


tatattagta ggaaaaagcg 




5340 


attxctbbet 


caattcgaag 


cagecataga 


tagegacbtt 




aac?t9cccat 


5400 


ctatcttcca 


tacetcacaa 


caaaatgtct 


gtc acta etc 




atatggctta 


54*0 


gatctcaact 


casaccccga 


ggatcctaaa gggatttgct 


tttbatbcefc 

V M V ^^^^ W ^iT^P A* W 


aabgtttbtg 


S520 


gatggtttga 


tttatttcta 


actcgaactt 


aCtaatcttg 


tsc cagaac c 


aaQatottcf a c 


SSBO 


3§acgaaaac 


gccgaaactc 


tcacaatact 


tatcgaactc 


cttatogaaa 


gagcaactca 


SSAQ 


atggectgag 


gtcsctcaca 


ccataagctg 


cctagaegtc 


ctgagatctt 


ttgeaatege 


5*700 


agcaagtctc 


tctgctggaa geatggccag 


gectgtcatt 


tttcccgaat 


cagaagetac 


5760 


agatc&gaac 


cagaasacaa 


aagggccaat 


acttaaaatc 


caaggactat 


ggcatccatt 


5620 


tgcagttgca 


gccgatggtc aattgcctgt 


tccgaacgat 


ataeeectfcg 


gcgaggctag 


S6BQ 


aagaagCa§t 


ggcagcactc 


atccccggtc 


attge taccg 


aegggaccaa 


acatgggegg 


5940 


aaAatcaact 


cttcttcgtg 


caacatgtct 


ggccgttatc 


cttgcccaag 


tttgtatact 


b ooo 


cgttagataa 


ttactctatt 


cfcttgeaatc 


agttcCteaa 


catgaataat 


aaat be tget 




ttcLgtccgc 


agcttggceg 


ctacgcgccg 


tgtgagtcdt 


gegaaatetc 


ectcgtggat 


o 12D 


actatcttca 


caaggcttgg 


cgcatctgat 


ag&atcatga 


caggagagag 


caagttttgt 




tcbcaaaats 


ccaattcctc gaactactta 


ctcagatttt 


gtctgattgg 


acaaggfcggt 




tttgettttt 


trtaggtRCC 


tttttggtag 


aatgeacega gaeagegtea 


gttcttcaga 




atgcaactca ggatt cacta gtaatccttg 


acgaactggg 


cagaggaacc 


agtactctcg 




atggatacgo 


cattgcatac 


tcggtaacct 


gctcttctcc 


tbcaacttat 


aeetgdtgat 




caacaaaaac 


atgcaattca 


tfcttgctgaa 


act tat t gat 


tbatatcagg 


ttcttegtea 


6$BO 


cctggtagag 


aaagetcaat 


gteggatgct 


cttCgcaaca 


cattaccacc 


ctctcaccaa 


6540 
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ggaactegcg tctcacccsc gtgtcacctc gaaacacatg gcttgcgcat teaaatcaag 

atctgattaC caaccacgtg gttgtgatca agacctagtg ttcttgtacc gtttaaccga 66$ Q 

gggagctrtgt cccgagagct acggacttca agtggcactc acggctggaa taccaaaeca 5^20 

ageggcegaa acagcatcag gtgctgctca agccatgaag agatcaatrg gggaaaactc 67B0 

caagtcaagC gagccaagat ctgagttcfcc aagtctgcat gaagactggc tcaagtcatt £340 

ggtgggcatt Cctogagtcg cccacaacaa tgcceccatt ggcgaagatg aetacgacac $900 

tttgttttgc ttatggcatg agatcaaatc ctcttactgt gttcccaaafc aaatggctat 6*€0 

gacataacac tatctgaagc tcgttaagtc ttCtgcttCt CCgatgtttA ttcctcttaa 702 Q 

aaaatgceta tatatcaaaa aatfcgtttec tcgattataa caagattata tatgtatctg 70 8 0 

tcggtttagc tatggtatat aatatacgta CgtCcatgag attggtcaag agaaatacte 7 14 0 

acaaacagta cactaagaag gaaaeatgtt tatgcattaa cttaagcttc aagataaact 72 00 

gcaaataacc tcgaccaaag tcgcaaagac caaaeacaaa ttacaaaact tauagactt 72 GO 

aagt&Ctgaa Ctccdtaaaa ccaaaaaaaa aaacagaaca tattttgttg catctacaaa 73 20 

caacacaaac ctacatagcc tataacttac tcatcactga gattaacafcc agaaecatcc 7^eo 

tccatctctt catcttcact ctcatcatca tcaccaccac catgatgatt ctcctcctct 744 0 

tcacgtaacc tagcaatctc actctgagcc ctatcaacaa tccgcctctt ctgcaactcc 7500 

aaatctctct gaaaatcagc tctcatcttc tccaacecct tcactcgctc ccecttaetc 7 5*Q 

ttctccatct tcfccataaae ettcecaaac ctctcaacag aatccgccaa cafccttatac "7 620 

gaagcagcgt cattaacett ettcctetcg tactcaacct catcatcctc atcctcctcc 76 BO 

tcttcagaat caccaggact ate cat cat c tcaceaaaec cat tag act t. atctaaataa 7740 

accttagtgt tcataaacac aaactcacct gaatcaacac cacaagceaa acctaaatcc 760 0 

gacttgg^cg aaacacaaag caacatatcc aacttattga aaaacgacca tttacttgaa 7 $6 0 

cctaaacctg atttctcaao cttaatcttc tetttfcetae acetcctcte caagtcatca *7&2 0 

atcactetce tacattgegt ctcagatttc tccatcctta gccecccact eactttctca 7 93 0 

gctacttcat tccaatcctc gtccctcaaa ctccttctac ecaattgeaa aaacctatct 0 04 0 

CCCCaaactt Caagcaacac as 0 0 62 
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